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(57) Abstract 



The present invention relates to the isolation, identification and characterization of nucleotide sequences encoding an enzyme catalysing 
the transfer of fatty acids from phospholipids to diacylglycerol in the biosynthetic pathway for the production of triacylglycerol, to the said 
enzymes and a process for the production of triacylglycerols. 
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WO 00/60095 1 PCT/EP00/02701 

A NEW CLASS OF ENZYMES IN THE BIOSYNTHETIC PATHWAY FOR THE 
PRODUCTION OF TRIACYLGLYCEROL AND RECOMBINANT DNA 
MOLECULES ENCODING THESE ENZYMES 

5 The present invention relates to the isolation, identification and characterization 
of recombinant DNA molecules encoding enzymes catalysing the transfer of 
fatty acids from phospholipids to diacylglycerol in the biosynthetic pathway for 
the production of triacylglycerol. 

10 Triacylglycerol (TAG) is the most common lipid-based energy reserve in nature. 
The main pathway for synthesis of TAG is believed to involve three sequential 
acyl-transfers from acyl-CoA to a glycerol backbone (1 , 2). For many years, 
acyl-CoA : diacylglycerol acyltransferase (DAGAT), which catalyzes the third 
acyl transfer reaction, was thought to be the only unique enzyme involved in 

is TAG synthesis. It acts by diverting diacylglycerol (DAG) from membrane lipid 
synthesis into TAG (2). Genes encoding this enzyme were recently identified 
both in the mouse (3) and in plants (4, 5), and the encoded proteins were 
shown to be homologous to acyl-CoA : cholesterol acyltransferase (ACAT). It 
was also recently reported that another DAGAT exists in the oleaginous fungus 

20 Mortierella ramanniana, which is unrelated to the mouse DAGAT, the ACAT 
gene family or to any other known gene (6). 

The instant invention relates to novel type of enzymes and their encoding 
genes for transformation. More specifically, the invention relates to use of a 

25 type of genes encoding a not previously described type of enzymes hereinafter 
designated phospholipid:diacylglycerol acyltransferases (PDAT), whereby this 
enzyme catalyses an acyl-CoA-independent reaction. The said type of genes 
expressed alone in transgenic organisms will enhance the total amount of oil 
(triacylglycerols) produced in the cells. The PDAT genes, in combination with a 

30 gene for the synthesis of an uncommon fatty acid will, when expressed in 
transgenic organisms, enhance the levels of the uncommon fatty acids in the 
triacylglycerols. 
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There is considerable interest world-wide in producing chemical feedstock, 
such as fatty acids, for industrial use from renewable plant resources rather 
than non-renewable petrochemicals. This concept has broad appeal to 
5 manufacturers and consumers on the basis of resource conservation and 
provides significant opportunity to develop new industrial crops for agriculture. 

There is a diverse array of unusual fatty acids in oils from wild plant species 
and these have been well characterised. Many of these acids have industrial 
10 potential and this has led to interest in domesticating relevant plant species to 
enable agricultural production of particular fatty acids. 

Development in genetic engineering technologies combined with greater 
understanding of the biosynthesis of unusual fatty acids now makes it possible 
is to transfer genes coding for key enzymes involved in the synthesis of a 
particular fatty acid from a wild species into domesticated oilseed crops. In this 
way individual fatty acids can be produced in high purity and quantities at 
moderate costs. 

20 In all crops like rape, sunflower, oilpalm etc., the oil (i.e. triacylglycerols) is the 
most valuable product of the seeds or fruits and other compounds like starch, 
protein, and fibre is regarded as by-products with less value. Enhancing the 
quantity of oil per weight basis at the expense of other compounds in oil crops 
would therefore increase the value of crop. If genes, regulating the allocation of 

25 reduced carbon into the production of oil can be up-regulated, the cells will 
accumulate more oil on the expense of other products. Such genes might not 
only be used in already high oil producing cells, such as oil crops, but could 
also induce significant oil production in moderate or low oil containing crops 
such as e.g. soy, oat, maize, potato, sugarbeats, and turnips as well as in 

30 micro-organisms. 
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Summary of the invention 

Many of the unusual fatty acids of interest, e.g. medium chain fatty acids, 
hydroxy fatty acids, epoxy fatty acids and acetylenic fatty acids, have physical 

5 properties that are distinctly different from the common plant fatty acids. The 
present inventors have found that, in plant species naturally accumulating 
these uncommon fatty acids in their seed oil (i.e. triacylglycerol), these acids 
are absent, or present in very low amounts in the membrane (phospho)lipids of 
the seed. The low concentration of these acids in the membrane lipids is most 

10 likely a prerequisite for proper membrane function and thereby for proper cell 
functions. One aspect of the invention is that seeds of transgenic crops can be 
made to accumulate high amounts of uncommon fatty acids if these fatty acids 
are efficiently removed from the membrane lipids and channelled into seed 
triacylglycerols. 

15 

The inventors have identified a novel class of enzymes in plants catalysing the 
transfer of fatty acids from phospholipids to diacylglycerol in the production of 
triacylglycerol through an acyl-CoA-independent reaction and that these 
enzymes (phospholipididiacylglycerol acyltransf erases, abbreviated as PDAT) 
20 are involved in the removal of hydroxylated, epoxygenated fatty acids, and 
probably also other uncommon fatty acids such as medium chain fatty acids, 
from phospholipids in plants. 

This enzyme reaction was shown to be present in microsomal preparations 
from baker's yeast (Saccharomyces cerevisiae). The instant invention further 

25 pertains to an enzyme comprising an amino acid sequence as set forth in SEQ 
ID No. 2 or a functional fragment, derivate, allele, homolog or isoenzyme 
thereof. A so called , knock out 4 yeast mutant, disrupted in the respective gene 
was obtained and microsomal membranes from the mutant was shown to 
totally lack PDAT activity. Thus, it was proved that the disrupted gene encodes 

30 a PDAT enzyme (SEQ ID NO. 1 and 2). Furtherm, this PDAT enzyme is 
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characterized through the amino acid sequence as set forth in SEQ ID NO 2 
containing a lipase motif of the conserved sequence string FXKWVEA. 
The instant invention pertains further to an enzyme comprising an amino acid 
sequence as set forth in SEQ ID NO. 1a, 2b or 5a or a functional fragment, 

5 derivate, allele, homolog or isoenzyme thereof. 

Further genes and/or proteins of so far unknown function were identified and 
are contemplated within the scope of the instant invention. A gene from 
Schizosaccharomyces pombe, SPBC776.14 (SEQ ID. NO. 3), a putative open 
reading frame CAA22887 of the SPBC776.14 (SEQ ID NO. 13) were identified. 

10 Further Arabidopsis thaliana genomic sequences (SEQ ID NO. 4, 10 and 11) 
coding for putative proteins were identified, as well as a putative open reading 
frame AAC80628 from the A. thaliana locus AC 004557 (SEQ ID NO. 14) and a 
putative open reading frame AAD 10668 from the A. thaliana locus AC 003027 
(SEQ ID NO. 15) were identified. 

15 Also, a partially sequenced cDNA clone from Neurospora crassa (SEQ ID NO. 
9) and a Zea mays EST (Extended Sequence Tac) clone (SEQ ID NO. 7) and 
corresponding putative amino acid sequence (SEQ ID NO. 8) were identified. 
Finally, two cDNA clones were identified, one Arabidopsis thaliana EST (SEQ 
ID NO. 5 and corresponding predicted amino acid sequence SEQ ID NO. 6) 

20 and a Lycopersicon esculentum EST clone (SEQ ID NO. 12) were identified. 
Further, enzymes designated as PDAT comprising an amino acid sequence 
selected from the group consisting of sequences as set forth in SEQ ID NO 2a, 
3a, 5b, 6 or 7b containing a lipase motif FXKWVEA are contemplated within the 
scope of the invention. Moreover, an enzyme comprising an amino acid 

25 sequence encoded through a nucleotide sequence, a portion, derivate, allele or 
homolog thereof selected from the group consisting of sequences as set forth 
in SEQ ID No. 1, 1b, 3, 3b, 4, 4a, 4b, 5, 5b, 6b, 7. 8b, 9, 9b, 10, 10b, 1 1 , 1 1b or 
12 or a functional fragment, derivate, allele, homolog or isoenzyme of the 
enzyme encoding amino acid sequence are included within the scope of the 

30 invention. 
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A functional fragment of the instant enzyme is understood to be any 
polypeptide sequence which shows specific enzyme activity of a 
phospholipid:diacylglycerol acyltransf erase (PDAT). The length of the functional 
fragment can for example vary in a range from about 660 ±10 amino acids to 

5 660 ± 250 amino acids, preferably from about 660 ± 50 to 660 ±100 amino 
acids, whereby the „basic number" of 660 amino acids corresponds in this case 
to the polypeptide chain of the PDAT enzyme of SEQ ID NO. 2 encoded by a 
nucleotide sequence according to SEQ ID NO- 1. Consequently, the „basic 
number" of functional fullength enzyme can vary in correspondance to the 

10 encoding nucleotide sequence. 

A portion of the instant nucleotide sequence is meant to be any nucleotide 
sequence encoding a polypeptid which shows specific activity of a 
phospholipid:diacylglycerol acyltransferase (PDAT). The length of the 
nucleotide portion can vary in a wide range of about several hundreds of 

15 nucleotides based upon the coding region of the gene or a highly conserved 
sequence. For example the length varies in a range form about 1900 ± 10 to 
1900 ± 1000 nucleotides, preferably form about 1900 ± 50 to 1900 ±700 and 
more preferably form about 1900 ± 100 to 1900 ± 500 nucleotides, whereby the 
„basic number" of 1900 nucleotides corresponds in this case to the encoding 

20 nucleotide sequence of the PDAT enzyme of SEQ ID NO. 1. Consequently, the 
„basic number" of functional fullength gene can vary. 

An allelic variant of the instant nucleotide sequence is understood to be any 
different nucleotide sequence which encodes a polypeptide with a functionally 

25 equivalent function. The alleles pertain naturally occuring variants of the instant 
nucleotide sequences as well as synthetic nucleotide sequences produced by 
methods known in the art. Contemplated are even altered nucleotide 
sequences which result in an enzyme with altered activity and/or regulation or 
which is resistant against specific inhibitors. The instant invention further 

30 includes natural or synthetic mutations of the originally isolated nucleotide 
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sequences. These mutations can be substitution, addition, deletion, inversion 
or insertion of one or more nucleotides. 

A homologous nucleotide sequence is understood to be a complementary 
5 sequence and/or a sequence which specifically hybridizes with the instant 
nucleotide sequence. Hybridizing sequences include similar sequences 
selected from the group of DNA or RNA which specifically interact to the instant 
nucleotide sequences under at least moderate stringency conditions which are 
known in the art. A preferred, non-limiting example of stringent hybridization 
to conditions are hybridization in 6X sodium chloride/sodium citrate (SSC) at 
about 45°C, followed by one or more washes in 0.2 X SSC, 0.1% SDS at 50- 
65°C. This further includes short nucleotide sequences of e.g. 10 to 30 
nucleotides, preferably 12 to 15 nucleotides. Included are also primer or 
hybridization probes. 

15 

A homologous nucleotide sequence included within the scope of the instant 
invention is a sequence which is at least about 40%, preferably at least about 
50 % or 60%, and more preferably at least about 70%, 80% or 90% and most 
preferably at least about 95%, 96%, 97%, 98% or 99% or more homologous to 
20 a nucleotide sequence of SEQ ID NO. 1 . 

All of the aforementioned definitions are true for amino acid seqences and 
functional enzymes and can easily transfered by a person skilled in the art. 

Isoenzymes are understood to be enzymes which have the same or a similar 
25 substrate specifity and/or catalytic activity but a different primary structure. 

In a first embodiment, this invention is directed to nucleic acid sequences that 
encode a PDAT. This includes sequences that encode biologically active 
PDATs as well as sequences that are to be used as probes, vectors for 
30 transformation or cloning intermediates. The PDAT encoding sequence may 
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encode a complete or partial sequence depending upon the intended use. All or 
a portion of the genomic sequence, cDNA sequence, precursor PDAT or 
mature PDAT is intended. 

s Further included is a nucleotide sequence selected from the group consisting of 
sequences set forth in SEQ ID No. 1 , 1b, 3, 3b, 4, 4a, 4b, 9b, 10, 10b or 1 1 or a 
portion, derivate, allele or homolog thereof. The invention pertains a partial 
nucleotide sequence corresponding to a fullength nucleotide sequence selected 
from the group consisting of sequences set forth in SEQ ID No. 5, 5b, 6b, 7, 8b, 

io 9, 11b or 12 or a portion, derivate, allele or homolog thereof. Moreover, a 
nucleotide sequence comprising a nucleotide sequence which is at least 40% 
homologous to a nucleotide sequence selected form the group consisting of 
those sequences set forth in SEQ ID No. 1 1 b, 3, 3b, 4, 4a, 4b, 5, 5b, 6b, 7, 8b, 
9, 9b, 10, 10b, 11, 1 1b or 12 is contemplated within the scope of the invention. 

15 

The instant invention pertains to a gene construct comprising a said nucleotide 
sequences of the instant invention which is operably linked to a heterologous 
nucleic acid. 

The term operably linked means a serial organisation e.g. of a promotor, coding 
20 sequence, terminator and/or further regulatory elements whereby each element 
can fulfill its original function during expression of the nucleotide sequence. 

Further, a vector comprising of a said nucleotide sequence of the instant 
invention is contemplated in the instant invention. This includes also an 
25 expression vector as well as a vector further comprising a selectable marker 
gene and/or nucleotide sequences for the replication in a host cell and/or the 
integration into the genome of the host cell. 

In a different aspect, this invention relates to a method for producing a PDAT in 
30 a host cell or progeny thereof, including genetically engineered oil seeds, yeast 
and moulds or any other oil accumulating organism, via the expression of a 
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construct in the cell. Cells containing a PDAT as a result of the production of 
the PDAT encoding sequence are also contemplated within the scope of the 
invention. 

5 Further, the invention pertains a transgenic cell or organism containing a said 
nucleotide sequence and/or a said gene construct and/or a said vector. The 
object of the instant invention is further a transgenic cell or organism which is 
an eucaryotic cell or organism. Preferably, the transgenic cell or organism is a 
yeast cell or a plant cell or a plant. The instant invention further pertains said 

10 transgenic cell or organism having an altered biosynthetic pathway for the 
production of triacylglycerol. A transgenic cell or organism having an altered oil 
content is also contemplated within the scope of this invention. 

Further, the invention pertains a transgenic cell or organism wherein the activity 
is of PDAT is altered in said cell or organism. This altered activity of PDAT is 
characterized by an alteration in gene expression, catalytic activity and/or 
regulation of activity of the enzyme. Moreover, a transgenic cell or organism is 
included in the instant invention, wherein the altered biosynthetic pathway for 
the production of triacylglycerol is characterized by the prevention of 
20 accumulation of undesirable fatty acids in the membrane lipids. 

In a different embodiment, this invention also relates to methods of using a 
DNA sequence encoding a PDAT for increasing the oil-content within a cell. 

25 Another aspect of the invention relates to the accommodation of high amounts 
of uncomman fatty acids in the triacylglycerol produced within a cell, by 
introducing a DNA sequence producing a PDAT that specifically removes these 
fatty acids from the membrane lipids of the cell and channel them into 
triacylglycerol. Plant cells having such a modification are also contemplated 

30 herein. 
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Further, the invention pertains a process for the production of triacylglyceroi, 
comprising growing a said transgenic cell or organism under conditions 
whereby the said nucleotide sequence is expressed and whereby the said 
transgenic cells comprising a said enzyme catalysing the transfer of fatty acids 
5 from phospholipids to diacylglycerol forming triacylglyceroi. 

Moreover, triacylglycerols produced by the aforementioned process are 
included in scope of the instant invention. 

10 Object of the instant invention is further the use of an instant nucleotide 
sequence and/or a said enzyme for the production of triacylglyceroi and/or 
triacylglycerols with uncommon fatty acids. The use of a said instant nucleotide 
sequence and/or a said enzyme of the instant invention for the transformation 
of any cell or organism in order to be expressed in this cell or organism and 

is result in an altered, preferably increased oil content of this cell or organism is 
also contemplated within the scope of the instant invention, 

A PDAT of this invention includes any sequence of amino acids, such as a 
protein, polypeptide or peptide fragment obtainable from a microorganism, 
20 animal or plant source that demonstrates the ability to catalyse the production 
of triacylglyceroi from a phospholipid and diacylglycerol under enzyme reactive 
conditions. By „enzyme reactive conditions" is meant that any necessary 
conditions are available in an environment (e.g., such factors as temperature, 
pH, lack of inhibiting substances) which will permit the enzyme to function. 

25 

Other PDATs are obtainable from the specific sequences provided herein. 
Furthermore, it will be apparent that one can obtain natural and synthetic 
PDATs, including modified amino acid sequences and starting materials for 
synthetic-protein modelling from the examplified PDATs and from PDATs which 
30 are obtained through the use of such examplified sequences. Modified amino 
acid sequences include sequences that have been mutated, truncated, 
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increased and the like, whether such sequences were partially or wholly 
synthesised. Sequences that are actually purified from plant preparations or 
are identical or encode identical proteins thereto, regardless of the method 
used to obtain the protein or sequence, are equally considered naturally 
5 derived. 

Further, the nucleic acid probes (DNA and RNA) of the present invention can 
be used to screen and recover homologous" or „related" PDATs from a variety 
of plant and microbial sources. 

10 

Further, it is also apparent that a person skilled in the art can, with the 
information provided in this application, in any organism identify a PDAT 
activity, purify an enzyme with this activity and thereby identify a „non- 
homologous" nucleic acid sequence encoding such an enzyme. 

15 

The present invention can be essentially characterized by the following 
aspects: 

1 . Use of a PDAT gene (genomic clone or cDNA) for transformation. 
20 2. Use of a DNA molecule according to item 1 wherein said DNA is used for 
transformation of any organism in order to be expressed in this organism 
and result in an active recombinant PDAT enzyme in order to increase oil 
content of the organism. 

3. Use of a DNA molecule of item 1 wherein said DNA is used for 
25 transformation of any organism in order to prevent the accumulation of 

undesirable fatty acids in the membrane lipids. 

4. Use according to item 1 , wherein said PDAT gene is used for transforming 
transgenic oil accumulating organisms engineered to produce any 
uncommon fatty acid which is harmful if present in high amounts in 

30 membrane lipids, such as medium chain fatty acids, hydroxylated fatty 
acids, epoxygenated fatty acids and acetylenic fatty acids. 



SUBSTITUTE SHEET (RULE 26) 



.1 

WO 00/60095 



11 



5. Use according to item 1 , wherein said PDAT gene is used for transforming 
organisms, and wherein said organisms are crossed with other oil 
accummulating organisms engineered to produce any uncommon fatty acid 
which is harmful if present in high amounts in membrane lipids, comprising 

5 medium chain fatty acids, hydroxylated fatty acids, epoxygenated fatty 
acids and acetylenic fatty acids. 

6. Use according to item 1 , wherein the enzyme encoded by said PDAT gene 
or cDNA is coding for a PDAT with distinct acyl specificity. 

7. Use according to item 1 wherein said PDAT encoding gene or cDNA, is 
10 derived from Saccharomyces cereviseae, or contain nucleotide sequences 

coding for an amino acid sequence 30% or more identical to the amino acid 
sequence of PDAT as presented in SEQ. ID. NO. 2. 

8. Use according to item 1 wherein said PDAT encoding gene or cDNA is 
derived from Saccharomyces cereviseae, or contain nucleotide sequences 

15 coding for an amino acid sequence 40% or more identical to the amino acid 
sequence of PDAT as presented in SEQ. ID. NO. 2. 

9. Use according to item 1 wherein said PDAT encoding gene or cDNA is 
derived from Saccharomyces cereviseae, or contain nucleotide sequences 
coding for an amino acid sequence 60% or more identical to the amino acid 

20 sequence of PDAT as presented in SEQ. ID. NO. 2. 

10. Use according to item 1 wherein said PDAT encoding gene or cDNA is 
derived from Saccharomyces cereviseae, or contain nucleotide sequences 
coding for an amino acid sequence 80% or more identical to the amino acid 
sequence of PDAT as presented in SEQ. ID. NO. 2. 

25 11. Use according to item 1 wherein said PDAT encoding gene or cDNA is 
derived from plants or contain nucleotide sequences coding for an amino 
acid sequence 40% or more identical to the amino acid sequence of PDAT 
from Arabidopsis thaliana or to the protein encoded by the fullength 
counterpart of the partial Zea mays, Lycopericon esculentum, or 

30 Neurospora crassa cDNA clones. 
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12. Transgenic oil accumulating organisms comprising, in their genome, a 
PDAT gene transferred by recombinant DNA technology or somatic 
hybridization. 

13. Transgenic oil accumulating organisms according to item 12 comprising, in 
5 their genome, a PDAT gene having specificity for substrates with a 

particular uncommon fatty acid and the gene for said uncommon fatty acid. 

14. Transgenic organisms according to item 12 or 13 which are selected from 
the group consisting of fungi, plants and animals. 

15. Transgenic organisms according to item 12 or 13 which are selected from 
10 the group of agricultural plants. 

16. Transgenic organisms according to item 12 or 13 which are selected from 
the group of agricultural plants and where said PDAT gene is expressed 
under the control of a storage organ specific promotor. 

17. Transgenic organisms according to item 12 or 13 which are selected from 
15 the group of agricultural plants and where said PDAT gene is expressed 

under the control of a seed promotor. 

18. Oils from organisms according to item 12-17. 

19. A method for altering acyl specificity of a PDAT by alteration of the 
nucleotide sequence of a naturally occurring encoding gene and as a 

20 consequence of this alternation creating a gene encoding for an enzyme 
with novel acyl specifity. 

20. A protein encoded by a DNA molecule according to item 1 or a functional 
fragment thereof. 

21 . A protein of item 20 designated phospholipididiacylglycerol acyltransferase. 
25 22. A protein of item 21 which has a distinct acyl specificity. 

23. A protein of item 13 having the amino acid sequence as set forth in SEQ, 
ID NO. 2, 13, 14 or 15 (and the proteins encoded by the fuilength or partial 
genes set forth in SEQ. ID. NO. 1, 3, 4, 5, 7, 9, 10, 11 or 12) or an amino 
acid sequence with at least 30 % homology to said amino acid sequence. 
30 24. A protein of item 23 isolated from Saccharomyces cereviseae. 
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General methods: 

Yeast strains and plasmids . The wild type yeast strains used were either 
FY1679 (MATa his3-A200 Ieu2-Al trp1-A6 ura3-52) or W303-1A (MATa ADE2- 
1 can1-100 his3-11,15 Ieu2-3,112 trp1-1 ura3-1) (7). The YNR008w::KanMX2 
disruption strain FVKT004-04C(AL), which is congenic to FY1679, was 
obtained from the Euroscarf collection (8). A 2751 bp fragment containing the 
YNR008w gene with 583 bp of 5' and 183 bp of 3' flanking DNA was amplified 
from W303-1A genomic DNA using Taq polymerase with 5'- 
TCTCCATCTTCTGCAAAACCT-3' and 5'-CCTGTCAAAAACCTTCTCCTC-3' as 
primers. The resulting PCR product was purified by agarose gel electrophoresis 
and cloned into the EcoRV site of pBluescript (pbluescript-pdat). For 
complementation experiments, the cloned fragment was released from 
pBluescript by H/ndlll-SacI digestion and then cloned between the H/ndlll and 
Sad sites of pFL39 (9), thus generating pUS1 . For overexpression of the PDAT 
gene, a 2202 bp EcoRI fragment from the pBluscript plasmid which contains 
only 24 bp of 5' flanking DNA was cloned into the BamHI site of the GAL1- 
TPK2 expression vector pJN92 (12), thus generating pUS4. 

Microsomal preparations. Microsomes from developing seeds of sunflower 
(Helianthus annuus), Ricinus communis and Crepis palaestina were prepared 
using the procedure of Stobart and Stymne (11). To obtain yeast microsomes, 
1g of yeast cells (fresh weight) was re-suspended in 8 ml of ice-cold buffer (20 
mM Tris-CI, pH 7.9, 10 mM MgCI 2 , 1 mM EDTA, 5 % (v/v) glycerol, 1 mM DTT, 
0.3 M ammonium sulfate) in a 12 ml glass tube. To this tube, 4 ml of glass 
beads (diameter 0.45-0.5 mm) were added, and the tube was then heavily 
shaken (3 x 60 s) in an MSK cell homogenizer (B. Braun Melsungen AG, 
Germany). The homogenized suspension was centrifuged at 20,000 x g for 15 
min at 6°C and the resulting supernatant was again centrifuged at 100,000 x g 
for 2 h at 6°C. The 100,000 x g pellet was resuspended in 0.1 M potassium 
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phosphate (pH 7.2), and stored at -80°C. It is subsequently referred to as the 
crude yeast microsomal fraction. 

Lipid substrates. Radio-labeled ricinoleic (12-hydroxy-9-octadecenoic) and 
5 vernolic (12,13-epoxy-9-octadecenoic) acids were synthesized enzymatically 
from [1- 14 C]oleic acid and [1- 14 C]linoleic acid, respectively, by incubation with 
microsomal preparations from seeds of Ricinus communis and Crepis 
palaestina, respectively (12). The synthesis of phosphatidylcholines (PC) or 
phosphatidylethanolamines (PE) with 14 C-labeled acyl groups in the sn-2 
10 position was performed using either enzymatic (13), or synthetic (14) acylation 
of [ 14 C]oleic, [ 14 C]ricinoleic, or [ 14 C]vernolic acid. Dioleoyl-PC that was labeled 
in the s/>1 position was synthesized from sn-1 -[ 14 C]oleoyl-lyso-PC and 
unlabeled oleic acid as described in (14). S/>1-oleoyl-sn-2-[ 14 C]ricinoleoyl-DAG 
was synthesized from PC by the action of phospholipase C type XI from B. 
15 Cereus (Sigma Chemical Co.) as described in (15). Monovemoloyl- and 
divemoleoyl-DAG were synthesized from TAG extracted from seeds of 
Euphorbia lagascae, using the TAG-Hpase (Rizhopus arrhizus, Sigma Chemical 
Co.) as previously described (16). Monoricinoleoyl-TAG was synthesized 
according to the same method using TAG extracted from Castor bean. 

20 

Lipid analysis. .- .Total lipid composition of yeast were determined from cells 
harvested from a 40 ml liquid culture, broken in a glass-bead shaker and 
extracted into chloroform as described by Bligh and Dyer (17), and then 
separated by thin layer chromatography in hexane/diethylether/acetic acid 

25 (80:20:1) using pre-coated silica gel 60 plates (Merck). The lipid areas were 
located by brief exposure to l 2 vapors and identified by means of appropriate 
standards. Polar lipids, sterol-esters and triacylglycerols, as well as the 
remaining minor lipid classes, referred to as other lipids, were excised from the 
plates. Fatty acid methylesters were prepared by heating the dry excised 

30 material at 85 °C for 60 min in 2% (v/v) sulfuric acid in dry methanol. The 
methyl esters were extracted with hexane and analyzed by GLC through a 50 m 
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x 0.32 mm CP-Wax58-CB fused-silica column (Chrompack), with 
methylheptadecanoic acid as an internal standard. The fatty acid content of 
each fraction was quantified and used to calculate the relative amount of each 
lipid class. In order to determine the total lipid content, 3 ml aliquots from yeast 
5 cultures were harvested by centrifugation and the resulting pellets were washed 
with distilled water and lyophilized. The weight of the dried cells was 
determined and the fatty acid content was quantified by GLC-analyses after 
conversion to methylesters as described above. The lipid content was then 
calculated as nmol fatty acid (FA) per mg dry weight yeast. 

io 

Enzyme assays. Aliquots of crude microsomal fractions (corresponding to 
10 nmol of microsomal PC) from developing plant seeds or yeast cells were 
lyophilized over night. 14 C-Labeied substrate lipids dissolved in benzene were 
then added to the dried microsomes. The benzene was evaporated under a 

15 stream of N2, leaving the lipids in direct contact with the membranes, and 0.1 
ml of 50 mM potassium phosphate (pH 7.2) was added. The suspension was 
thoroughly mixed and incubated at 30°C for the time period indicated, up to 90 
min. Lipids were extracted from the reaction mixture using chloroform and 
separated by thin layer chromatography in hexane/diethylether/acetic acid 

20 (35:70:1.5) using silica gel 60 plates (Merck). The radioactive lipids were 
visualized and quantified on the plates by electronic autoradiography (Instant 
Imager, Packard, US). 

Yeast cultivation. Yeast cells were grown at 28°C on a rotatory shaker in 
25 liquid YPD medium (1% yeast extract, 2% peptone, 2% glucose), synthetic 
medium (18) containing 2% (v/v) glycerol and 2% (v/v) ethanol, or minimal 
medium (19) containing 16 g/l of glycerol. 

The instant invention is further characterized by the following examples which 
30 are not limiting: 
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Acvl-CoA-indeoendent synthesis of TAG by oil seed microsomes. A large 
number of unusual fatty acids can be found in oil seeds (20). Many of these 
fatty acids, such as ricinoleic (21) and vemolic acids (22), are synthesized 
using phosphatidylcholin (PC) with oleoyl or linoleoyl groups esterified to the 

5 s/>2 position, respectively, as the immediate precursor. However, even though 
PC can be a substrate for unusual fatty acid synthesis and is the major 
membrane lipids in seeds, unusual fatty acids are rarely found in the 
membranes. Instead, they are mainly incorporated into the TAG. A mechanism 
for efficient and selective transfer of these unusual acyl groups from PC into 

10 TAG must therefore exist in oil seeds that accumulate such unusual fatty acids. 
This transfer reaction was biochemically characterized in seeds from castor 
bean (Ricinus communis) and Crepis palaestina, plants which accumulate high 
levels of ricinoleic and vemolic acid, respectively, and sunflower (Helianthus 
annuus), a plant which has only common fatty acids in its seed oil. Crude 

15 microsomal fractions from developing seeds were incubated with PC having 
14 C-labeled oleoyl, ricinoieoyl or vemoloyl groups at the s/7-2 position. After the 
incubation, lipids were extracted and analyzed by thin layer chromatography. 
We found that the amount of radioactivity that was incorporated into the neutral 
lipid fraction increased linearly over a period of 4 hours (data not shown). The 

20 distribution of [ 14 C]acyl groups within the neutral lipid fraction was analyzed 
after 80 min (Fig. 1). Interestingly the amount and distribution of radioactivity 
between diffferent neutral lipids were strongly dependent both on the plant 
species and on the type of [ 14 C]acyl chain. Thus, sunflower microsomes 
incorporated most of the label into DAG, regardless of the type of [ 14 C]acyl 

25 group. In contrast, ft communis microsomes preferentially incorporated 
[ 14 C]ricinoleoyl and [ 14 C]vernoloyl groups into TAG, while [ 14 C]oleyl groups 
mostly were found in DAG. C. palaestina microsomes, finally, incorporated only 
[ 14 C]vemolyol groups into TAG, with [ 14 C]ricinoleyl groups being found mostly 
as free fatty acids, and [ 14 C]oleyl groups in DAG. This shows that the high in 

30 vivo levels of ricinoleic acid and vemolic acid in the TAG pool of ft communis 
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and C. palaestina, respectively, can be explained by an efficient and selective 
transfer of the corresponding acyl groups from PC to TAG in these organisms. 

The in-vitro synthesis of triacylglycerols in microsomal preparations of 
5 developing castor bean is summarized in table 1 . 

PDAT: a novel enzyme that catalyzes acvl-CoA independent synthesis of 
TAG. It was investigated if DAG could serve both as an acyl donor as well as 
an acyl acceptor in the reactions catalyzed by the oil seed microsomes. 

10 Thererfore, unlabeled divernoloyl-DAG was incubated with either sn-1 -oleoyl- 
sn-2-[ 14 C]ricinoleoyl-DAG or sn-1-oleoyi-sn-2-[ 14 C]ricinoleoyl-PC in the 
presence of ft communis microsomes. The synthesis of TAG molecules 
containing both [ u C]ricinoleoyl and vernoloyl groups was 5 fold higher when 
[ 14 C]ricinoleoyl-PC served as acyl donor as compared to [ 14 C]ricinoleoyl-DAG 

15 (fig.1B). These data strongly suggests that PC is the immediate acyl donor and 
DAG the acyl acceptor in the acyl-CoA-independent formation of TAG by oil 
seed microsomes. Therefore, this reaction is catalyzed by a new enzyme which 
we call phospholipid : diacylglycerol acyltransf erase (PDAT). 

20 PDAT activity in yeast microsomes. Wild type yeast cells were cultivated 

under conditions where TAG synthesis is induced. Microsomal membranes 
were prepared from these cells and incubated with s/>2-[ 14 C]-ricinoleoyl-PC 
and DAG and the 14 C-labeled products formed were analyzed. The PC-derived 
[ 14 C]ricinoleoyl groups within the neutral lipid fraction mainly were found in free 

25 fatty acids or TAG, and also that the amount of TAG synthesized was 
dependent on the amount of DAG that was added to the reaction (Fig.2). The in 
vitro synthesis of TAG containing both ricinoleoyl and vernoloyl groups, a TAG 
species not present in vivo, from exogenous added s/>2-[ 14 C]ricinoleoyl-PC and 
unlabelled vernoloyl-DAG (Fig. 2, lane 3) clearly demonstrates the existence of 

30 an acyl-CoA-independent synthesis of TAG involving PC and DAG as 
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substrates in yeast microsomal membranes. Consequently. TAG synthesis in 
yeast can be catalyzed by an enzyme similar to the PDAT found in plants. 

The PDAT encoding aene in veast 

5 A gene in the yeast genome (YNR008w) is known, but nothing is known about 
the function of YNR008w, except that the gene is not essential for growth under 
normal circumstances. Microsomal membranes were prepared from the yeast 
strain FVKT004-04C(AL) (8) in which this gene with unknown function had been 
disrupted. PDAT activity in the microsomes were assayed using PC with 

10 radiolabeled fatty acids at the sn-2 position. The activity was found to be 
completely absent in the disruption strain (Fig. 2 lane 4). Significantly, the 
activity could be partially restored by the presence of YNR008w on the single 
copy plasmid pUS1 (Fig. 2 lane 5). Moreover, acyl groups of 
phosphatidylethanolamine (PE) were efficiently incorporated into TAG by 

is microsomes from the wild type strain whereas no incorporation occured from 
this substrate in the mutant strain (data not shown). This shows that YNR008w 
encodes a yeast PDAT which catalyzes the transfer of an acyl group from the 
sn-2 position of phospholipids to DAG, thus forming TAG. It should be noted 
that no cholesterol esters were formed from radioactive PC even in incubations 

20 with added ergosterols, nor were the amount of radioactive free fatty acids 
formed from PC affected by disruption of the YNR008w gene (data not shown). 
This demonstrates that yeast PDAT do not have cholesterol ester synthesising 
or phosphoiipase activities. 

25 Increased TAG content in veast cells that overexpress PDAT . The effect 

of overexpressing the PDAT-encoding gene was studied by transforming a wild 
type yeast strain with the pUS4 plasmid in which the gene is expressed from 
the galactose-induced GAL1:TPK2 promoter. Cells containing the empty 
expression vector were used as a control. The cells were grown in synthetic 

30 glycerol-ethanol medium, and expression of the gene was induced after either 2 
hours (early log phase) or 25 hours (stationary phase) by the addition of 
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galactose. The cells were then incubated for another 21 hours, after which they 
were harvested and assays were performed. We found that overexpression of 
PDAT had no significant effect on the growth rate as determined by the optical 
density. However, the total lipid content, measured as \imo\ fatty acids per mg 

5 yeast dry weight, was 47% (log phase) or 29% (stationary phase) higher in the 
PDAT overexpressing strain than in the control. Furthermore, the polar lipid and 
sterolester content was unaffected by overexpression of PDAT. Instead, the 
elevated lipid content in these cells is entirely due to an increased TAG content 
(Fig. 3A,B). Thus, the amount of TAG was increased by 2-fold in PDAT 

io overexpressing early log phase cells and by 40% in stationary phase cells. It is 
interesting to note that a significant increase in the TAG content was achieved 
by overexpressing PDAT even under conditions (i.e. in stationary phase) where 
DAGAT is induced and thus contributes significantly to TAG synthesis. In vitro 
PDAT activity assayed in microsomes from the PDAT overexpressing strain 

15 was 7-fold higher than in the control strain, a finding which is consistent with the 
increased levels of TAG that we observed in vivo (Fig. 3C). These results 
clearly demonstrate the potential use of the PDAT gene in increasing the oil 
content in transgenic organisms. 

20 Substrate specificity of veast PDAT The substrate specificity of yeast 

PDAT was analyzed using microsomes prepared from the PDAT 
overexpressing strain (see Fig. 4). The rate of TAG synthesis, under conditions 
given in figure 4 with di-oleoyl-PC as the acyl-donor, was 0.15 nmoi per min 
and mg protein. With both oleoyl groups of PC labeled it was possible, under 

25 the given assay conditions, to detect the transfer of 1 1 pmol/min of [ 14 C]oieoyl 
chain into TAG and the formation of 15 pmol/min of lyso-PC. In microsomes 
from the PDAT-deficient strain, no TAG at ail and only trace amounts of lyso- 
PC was detected, strongly suggesting that yeast PDAT catalyses the formation 
of equimolar amounts of TAG and lyso-PC when supplied with PC and DAG as 

30 substrates. The fact that somewhat more lyso-PC than TAG is formed can be 
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explained by the presence of a phospholipase in yeast microsomes, which 
produces lyso-PC and unesterified fatty acids from PC. 

The specificity of yeast PDAT for different acyl group positions was 

5 investigated by incubating the microsomes with di-oleoyl-PC carrying a 
[ u C]acyl group either at the sn-1 position (Fig. 4A bar 2) or the sn-2 position 
(Fig. 4A bar 3). We found that the major 14 C-labeled product formed in the 
former case was lyso-PC, and in the latter case TAG. We conclude that yeast 
PDAT has a specificity for the transfer of acyl groups from the sn-2 position of 

10 the phospholipid to DAG, thus forming sn-1 -lyso-PC and TAG. Under the given 
assay conditions, trace amounts of 14 C-labelled DAG is formed from the sn-1 
labeled PC by the reversible action of a CDP-choline choline 
phosphotransferase. This labeled DAG can then be further converted into TAG 
by the PDAT activity. It is therefore not possible to distinguish whether the 

15 minor amounts of labeled TAG that is formed in the presence of di-oleoyl-PC 
carrying a [ 14 C]acyl group in the sn-1 position, is synthesized directly from the 
s/>1 -labeled PC by a PDAT that also can act on the sn-1 postion, or if it is first 
converted to sn-1 -labeled DAG and then acylated by a PDAT with strict 
selectivity for the transfer of acyl groups at the sn-2 position of PC. Taken 

20 together, this shows that the PDAT encoded by YNR008w catalyses an acyl 
transfer from the sn-2 position of PC to DAG, thus causing the formation of 
TAG and lyso-PC. 

The substrate specificity of yeast PDAT was further analyzed with respect 
25 to the headgroup of the acyl donor, the acyl group transferred and the acyl 
chains of the acceptor DAG molecule. The two major membrane lipids of S. 
cerevisiae are PC and PE, and as shown in Fig. 4B (bars 1 and 2), dioieoyl-PE 
is nearly 4-fold more efficient than dioleoyl-PC as acyl donor in the PDAT- 
catalyzed reaction. Moreover, the rate of acyl transfer is strongly dependent on 
30 the type of acyl group that is transferred. Thus, a ricinoleoyl group at the sn-2 
position of PC is 2.5 times more efficiently transferred into TAG than an oleoyl 
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group in the same position (Fig. 4B bars 1 and 3). In contrast, yeast PDAT has 
no preference for the transfer of vernoloyl groups over oleoyl groups (Fig. 4B 
bars 1 and 4). The acyl chain of the acceptor DAG molecule also affects the 
efficiency of the reaction. Thus, DAG with a ricinoleoyl or a vernoloyl group is a 
5 more efficient acyl acceptor than dioleoyl-DAG (Fig. 4B bars 1 , 5 and 6). Taken 
together, these results clearly show that the efficiency of the PDAT-catalyzed 
acyl transfer is strongly dependent on the properties of the substrate lipids. 

PDAT aenes. Nucleotide and amino acid sequences of several PDAT 
10 genes are given as SEQ ID No. 1 through 15. Futher provisional and/or partial 
sequences are given as SEQ ID NO 1a through 5a and 1b through 11b, 
respectively. One of the Arabidopsis genomic sequences (SEQ ID NO. 4) 
identfied an Arabidopsis EST cDNA clone; T04806. This cDNA clone was fully 
characterised and the nucleotide sequence is given as SEQ ID NO. 5. Based 
is on the sequence homology of the T04806 cDNA and the Arabidopsis thaiiana 
genomic DNA sequence (SEQ ID NO 4) it is apparent that an additional A is 
present at position 417 in the cDNA clone (data not shown). Excluding this 
nucleotide would give the amino acid sequence depicted in SEQ ID NO. 12. 

20 Increased TAG content in seeds of Arabidopsis thaiiana that express the 

veast PDAT. For the expression of the yeast PDAT gene in Arabidopsis 
thaiiana an EcoRI fragment from the pBluescript-PDAT was cloned together 
with napin promotor (25) into the vector pGPTV-KAN (26). A plasmid 
(pGNapPDAT) having the yeast PDAT gene in the correct orientation was 

25 identified and transformed into Agrobacterium tumefaciens. These bacteria 
were used to transform Arabidopsis thaiiana Columbia (C-24) plants using the 
root transformation method (27). Plants transformed with an empty vector were 
used as controls. 

First generation seeds (T1) were harvested and germinated on kanamycin 
30 containing medium. Second generation seeds (T2) were pooled from individual 
plants and their fatty acid contents analysed by quantification of their methyl 
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esthers by gas liquid chromatography after methylation of the seeds with 2% 
sulphuric acid in methanol at 85 °C for 1,5 hours. Quantification was done with 
heptadecanoic acid methyl esters as internal standard. 

From the transformation with pGNapPDAT one T1 plant (26-14) gave raise to 
5 seven T2 plants of which 3 plants yielded seeds with statistically (in a mean 
difference two-sided test) higher oil content than seeds from T2 plants 
generated from T1 plant 32-4 transformed with an empty vector (table 2). 
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Description of Figures 
FIG. 1. 

Metabolism of 14 C-labeled PC into the neutral lipid fraction bv Plant 
5 microsomes. (A) Microsomes from developing seeds of sunflower, R. 
communis and C. palaestina were incubated for 80 min at 30°C with PC (8 
nmol) having oleic acid in its sn-1 position, and either 14 C-labeled oleic, 
ricinoleic or vernolic acid in its sn-2 position. Radioactivity incorporated in TAG 
(open bars), DAG (solid bars), and unsterified fatty acids (hatched bars) was 
10 quantified using thin layer chromatography followed by electronic 
autoradiography, and is shown as percentage of added labeled substrate. (B) 
Synthesis in vitro of TAG carrying two vernoloyi and one [ 14 C]ricinoleoyl group 
by microsomes from R. communis. The substrates added were unlabeled 
divernoloyl-DAG (5 nmol), together with either sn-1-oleoyl-sn-2-[ 14 C]ricinoleoyl- 
15 DAG (0.4 nmol, 7700 dpm/nmol) or sn-1-oleoyl-sn-2-[ 14 C]ricinoleoyl-PC (0.4 
nmol, 7700 dpm/nmol). The microsomes were incubated with the substrates for 
30 min at 30°C, after which samples were removed for lipid analysis as 
described in the section ..general methods". The data shown are the average of 
two experiments. 

20 

FIG. 2. 

PDAT activity in veast microsomes, as visualized bv autoradioaram of neutral 
lipid products separated on TLC . Microsomal membranes (10 nmol of PC) from 
the wild type yeast strain FY1679 (lanes 1-3), a congenic yeast strain 

25 (FVKT004-04C(AL)) that is disrupted for YNR008w (lane 4) or the same 
disruption strain transformed with the plasmid pUS1 , containing the YNR008w 
gene behind its native promotor (lane 5), were assayed for PDAT activity. As 
substrates, we used 2 nmol sn-1-oleoyl-sn-2-[ 14 C]ricinoleoyl-PC together with 
either 5 nmol of dioleoyl-DAG (lanes 2, 4 and 5) or raooleoyl-vernoleoyl-DAG 

30 (lane 3). The enzymatic assay and lipid analysis was performed as described in 
Materials and Methods. The cells were precultured for 20 h in liquid YPD 
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medium, harvested and re-suspended in an equal volume of minimal medium 
(19) containing 16 g/l glycerol. The cells were then grown for an additional 24 h 
prior to being harvested. Selection for the plasmid was maintained by growing 
the transformed cells in synthetic medium lacking uracil (18). Abbreviations: 1- 
5 OH-TAG, monoricinoleoyl-TAG; 1 -OH-1 -ep-TAG, monoricinoleoyl- 
monovemoloyl-TAG; OH-FA, unesterified ricinoleic acid. 

Fig. 3. 

Lipid content (A.B^ and PDAT activity (C) in PDAT overexpressino veast cells. 

io The PDAT gene in the plasmid pUS4 was overexpressed from the galactose- 
induced GAL1-TPK2 promotor in the wild type strain W303-1A (7). Its 
expression was induced after (A) 2 hours or (B) 25 hours of growth by the 
addition of 2% final concentration (w/v) of galactose. The cells were then 
incubated for another 22 hours before being harvested. The amount of lipids of 

15 the harvested cells was determined by GLC-analysis of its fatty acid contents 
and is presented as pmol fatty acids per mg dry weight in either TAG (open 
bar), polar lipids (hatched bar), sterol esters (solid bar) and other lipids (striped 
bar). The data shown are the mean values of results with three independent 
yeast cultures. (C) In vitro synthesis of TAG by microsomes prepared from 

20 yeast cells containing either the empty vector (vector) or the PDAT plasmid (+ 
PDAT). The cells were grown as in Fig. 3A. The substrate lipids dioleoyl-DAG 
(2.5 nmoi) and s/>1-oleoyl-sn-2-[ 14 C]-oleoyl-PC (2 nmol) were added to aliquots 
of microsomes (10 nmol PC), which were then incubated for 10 min at 28 °C. 
The amount of label incorporated into TAG was quantified by electronic 

25 autoradiography. The results shown are the mean values of two experiments. 

FIG. 4. 

Substrate specificity of veast PDAT. The PDAT activity was assayed by 
incubating aliquots of lyophilized microsomes (10 nmol PC) with substrate lipids 
30 at 30°C for 10 min (panel A) or 90 min (panel B). Unlabeled DAG (2.5 nmol) 
was used as substrates together with different labeled phospholipids, as shown 
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in the figure. (A) Sn-position specificity of yeast PDAT regarding the acyl donor 
substrate. Dioleoyl-DAG together with either sn-1-[ 14 C]oleoyl-sn-2-[ 14 C]oleoyl- 
PC (di-[ 14 C]-PC), sn-1-[ 14 C]oleoyl-sn-2-oleoyl-PC (sn1-[ 14 C]-PC) or sn-1-oleoyl- 
sn-2-[ 14 C]oleoyl-PC (sn2-[ 14 C]-PC). (B) Specificity of yeast PDAT regarding 

5 phospholipid headgroup and of the acyl composition of the phospholipid as well 
as of the diacylglycerol. Dioleoyl-DAG together with either sn-1 -oleoyl-sn-2- 
[ 14 C]oleoyl-PC (oleoyl-PC), sn-1 -oleoyl-sn-2-[ 14 C]oleoyl-PE (oleoyl-PE), sn-1- 
oleoyl-sn-2-[ 14 C]ricinoleoyl-PC (ricinoleoyl-PC) or s/>1 -oleoyl-sn-2- 
[ 14 C]vemoloyl-PC (vernoloyl-PC). In the experiments presented in the 2 bars to 

10 the far right, monoricinoleoyl-DAG (ricinoleoyl-DAG or mono-vemoloyl-DAG 
(vemoloyl-DAG) were used together with sn-1-oleoyl-sn-2-[ 14 C]-oleoyl-PC. The 
label that was incorporated into TAG (solid bars) and lyso-PC (LPC, open bars) 
was quantified by electronic autoradiography. The results shown are the mean 
values of two experiments. The microsomes used were from W303-1A cells 

is overexpressing the PDAT gene from the GAL1-TPK2 promotor, as described in 
Fig. 3. The expression was induced at early stationary phase and the cells were 
harvested after an additional 24 h. 

20 TAB.1: 

In vitro synthesis of triacvlQlvcerols in microsomal preparations of developing 
castor bean. Aliqouts of microsomes (20 nmol PC) were lyophilised and 
substrate lipids were added in benzene solution: (A) 0.4 nmol [ 14 C]-DAG (7760 
dpm/nmol) and where indicated 1 .6 nmol unlabelled DAG; (B) 0.4 nmol [ 14 C]- 

25 DAG (7760 dpm/nmol) and 5 nmol unlabelled di-ricinoleoyl-PC and (C) 0.25 
nmol [ 14 C]-PC (4000 dpm/nmol) and 5 nmol unlabelled DAG. The benzene was 
evaporated by N 2 and 0.1 ml of 50 mM potassium phosphate was added, 
thoroughly mixed and incubated at 30 °C for (A) 20 min.; (B) and (C) 30 min.. 
Assays were terminated by extraction of the lipids in chloroform. The lipids 

30 were then separated by thin layer chromatography on silica gel 60 plates 
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(Merck; Darmstadt, Germany) in hexan/diethylether/acetic 35:70:1.5. The 
radioactive lipids were visualised and the radioactivity quantified on the plate by 
electronic autoradiography (Instant Imager, Packard, US). Results are 
presented as mean values of two experiments. 

5 

Radioactivity in different triacylglycerols (TAG) species formed. Abbreviations 
used: 1-OH-, mono-ricinoleoyl-; 2-OH, di-ricinoleoyl-; 3-OH-, triricinoleoyl; 1- 
OH-1-ver-, mono-ricinoleoly-monovernoleoyl-; 1-OH-2-ver-, mono-ricinoleoyl- 
divernoleoyl-. Radiolabeled DAG and PC were prepared enzymatically. The 
10 radiolabeled ricinoleoyl group is attached at the sn-2-position of the lipid and 
unlabelied oleoyl group at the sn-1 -position. Unlabelled DAG with vemoleoyl- or 
ricinoleoyl chains were prepared by the action of TAG lipase (6) on oil of 
Euphorbia lagascae or Castor bean, respectively. Synthetic di-ricinoleoyl-PC 
was kindly provided from Metapontum Agribios (Italy). 

15 

TAB.2: 

Total fatty acids per mg of T2 seeds pooled from individual Arabidopsis thaliana 
plants transformed with yeast PDAT gene under the control of napin promotor 
20 (26-14) or transformed with empty vector (32-4). 

* = stastistical difference between control plants and PDAT transformed plants 
in a mean difference two-sided test at a = 5. 
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Description of the SEQ ID: 

SEQ ID NO. 1: Genomic DNA sequence and suggested amino acid sequence of 
the Saccharomyces cerevisiae PDAT gene, YNR008w, with GenBank accession 
5 number Z71 623 and Y1 31 39, and with nucleotide ID number 1 302481 . 

SEQ ID NO. 2. The amino acid sequence of the suggested open reading frame 
YNROOSw from Saccharomyces cerevisiae. 

10 SEQ ID NO. 3r. Genomic DNA sequence of the Schizosaccharomyces pombe gene 
SPBC776.14. 

SEQ ID NO. 4: Genomic DNA sequence of part of the Arabidopsis thaliana locus 
with GenBank accession number AB006704. 

15 

SEQ ID NO. 5: Nucleotide sequence of the Arabidopsis thaliana cDNA clone with 
GenBank accession number T04806, and nucleotide ID number 315966. 

SEQ ID NO. 6: Predicted amino acid sequence of the Arabidopsis thaliana cDNA 
20 clone with GenBank accession number T04806. 

SEQ ID NO. 7: Nucleotide and amino acid sequence of the Zea mays EST clone 
with GenBank accession number AI491339, and nucleotide ID number 4388167. 

25 SEQ ID NO. 8: Predicted amino acid sequence of the Zea mays EST clone with 
GenBank accession number AI491339, and nucleotide ID number 4388167. 

SEQ ID NO. 9: DNA sequence of part of the Neurospora crassa EST clone 
W07G1, with GenBank accession number AI398644, and nucleotide ID number 
30 4241729. 
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SEQ ID NO. 10: Genomic DNA sequence of part of the Arabidopsis thaliana locus 
with GenBank accession number AC004557. 

SEQ ID NO. 1 1: Genomic DNA sequence of part of the Arabidopsis thaliana locus 
5 with GenBank accession number AC003027. 

SEQ ID NO. 12: DNA sequencce of part of the Lycopersicon esculentum cDNA 
clone with GenBank accession number AI486635. 

10 SEQ ID NO. 13: Amino acid sequence of the Schizosaccharomyces pombe 
putative open reading frame CAA22887 of the Schizosaccharomyces pombe gene 
SPBC776.14. 

SEQ ID NO. 14: Amino acid sequence of the Arabidopsis thaliana putative open 
15 reading frame AAC80628 derived from the Arabidopsis thaliana locus with 
GenBank accession number AC004557. 

SEQ ID NO 15: Amino acid sequence of the Arabidopsis thaliana putative open 
reading frame AAD10668 derived from the Arabidopsis thaliana locus with 
20 GenBank accession number AC003027. 

Further provisional and/or partial sequences are defined through the following 
SEQ IDs: 

25 SEQ ID NO. 1a: The amino acid sequence of the yeast ORF YNR008w from 
Saccharomyces cerevisiae. 

SEQ ID NO. 2a: Amino acid sequence of the region of the Arabidopsis thaliana 
genomic sequence (AC004557). 

30 
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SEQ ID NO. 3a: Amino acid sequence of the region of the Arabidopsis thaliana 
genomic sequence (AB006704). 

SEQ ID NO. 4a: The corresponding genomic DNA sequence and amino acid 
5 sequence of the yeast ORF YNR008w from Saccharomyces cerevisiae. 

SEQ ID NO. 5a: The amino acid sequence of the yeast ORF YNR008w from 
Saccharomyces cerevisiae derived form the corresponding genomic DNA 
sequence. 

10 

SEQ ID NO. 1b: Genomic DNA sequence of the Saccharomyces cerevisiae 
PDAT gene, YNR008w, genebank nucleotide ID number 1302481, and the 
suggested YNR008w amino acid sequence. 

15 

SEQ ID NO. 2b: The suggested amino acid sequence of the yeast gene 
YNR008w from Saccharomyces cerevisiae. 

SEQ ID NO. 3b: Genomic DNA sequence of the Schizosaccharomyces pombe 
20 gene SPBC776.14. 

SEQ ID NO. 4b: Genomic DNA sequence of part of the Arabidopsis thaliana 
locus with genebank accession number AB006704. 

25 SEQ ID NO. 5b: Nucleotide sequence and the corresponding amino acid 
sequence of the Arabidopsis thaliana EST-clone with genebank accession 
number T04806, and ID number 315966. 

SEQ ID NO. 6b: Nucleotide and amino acid sequence of the Zea mays cDNA 
30 clone with genebank ID number 43881 67. 
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SEQ ID NO. 7b: Amino acid sequence of the Zea mays cDNA clone with 
genebank ID number 4388167. 

SEQ ID NO. 8b: DNA sequence of part of the Neurospora crassa cDNA clone 
5 W07G1 , ID number 4241 729. 

SEQ ID NO. 9b: Genomic DNA sequence of part of the Arabidopsis thaliana 
locus with genebank accession number AC004557. 

10 SEQ ID NO. 10b: Genomic DNA sequence of part of the Arabidopsis thaliana 
locus with genebank accession number AC003027. 

SEQ ID NO. 1 1b: DNA sequence of part of the Lycopersicon esculentum cDNA 
clone with genebank accession number AI486635. 

15 
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Claims 



1 . An enzyme catalysing in an acyl-CoA-independent reaction the transfer of 
fatty acids from phospholipids to diacylglycerol in the biosynthetic pathway 

5 for the production of triacylglycerol. 

2. An enzyme according to claim 1, comprising an amino acid sequence as 
set forth in SEQ ID No. 2 or a functional fragment, derivate, allele, homolog 
or isoenzyme thereof. 

10 

3. An enzyme according to claims 1 or 2 designated as 
phospholipidrdiacylglycerol acyltransf erase (PDAT). 

4. An enzyme according to claims 1 to 3, comprising an amino acid sequence 
is as set forth in SEQ ID No. 1a, 2b or 5a or a functional fragment, derivate, 

allele, homolog or isoenzyme thereof. 

5. An enzyme according to claims 1 to 4, comprising an amino acid sequence 
selected from the group consisting of sequences as set forth in SEQ ID No. 

20 2a, 3a, 5b, 6, 7b, 8, 13, 14 or 15 or a functional fragment, derivate, allele, 
homolog or isoenzyme thereof. 

6. An enzyme according to claims 1 to 5, comprising an amino acid sequence 
encoded through a nucleotide sequence, a portion, derivate, allele or 

25 homolog thereof selected from the group consisting of sequences as set 
forth in SEQ ID No. 1, 1b, 3, 3b, 4, 4a, 4b, 5, 5b, 6b, 7, 8b, 9, 9b, 10, 10b, 
11, 11b or 12 or a functional fragment, derivate, allele, homolog or 
isoenzyme of the enzyme encoding amino acid sequence. 

30 7. A nucleotide sequence encoding an enzyme catalysing in an acyl-CoA- 
independent reaction the transfer of fatty acids from phospholipids to 
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diacylglycerol in the biosynthetic pathway for the production of 
triacylglycerol. 

8. A nucleotide sequence according to claim 7 encoding an enzyme 
5 designated as phospholipid:diacylglycerol acyltransferase (PDAT). 

9. A nucleotide sequence according to claims 7 or 8, selected from the group 
consisting of sequences as set forth in SEQ ID No. 1, 1b, 3, 3b, 4, 4a, 4b, 
9b, 10, 10b or 1 1 or a portion, derivate, allele or homolog thereof. 

10 

10. A partial nucleotide sequence corresponding to a fullength nucleotide 
sequence according to claims 7 to 9, selected from the group consisting of 
sequences as set forth in SEQ ID No. 5, 5b, 6b, 7, 8b, 9, 11b or 12 or a 
portion, derivate, allele or homolog thereof. 

15 

1 1 . A nucleotide sequence according to claims 7 to 10, comprising a nucleotide 
sequence which is at least 40% homologous to a nucleotide sequence 
selected form the group consisting of those sequences set forth in SEQ ID 
No. 1, 1b, 3, 3b, 4, 4a, 4b, 5, 5b, 6b, 7, 8b, 9. 9b, 10, 10b, 11, 11b or 12. 

20 

12. A gene construct comprising a nucleotide sequence according to claims 7 
to 1 1 operably linked to a heterologous nucleic acid. 

13. A vector comprising a nucleotide sequence according to claims 7 to 1 1 or a 
25 gene construct according to claim 12. 

14. A vector according to claim 13, which is an expression vector. 

15. A vector according to claims 13 or 14, further comprising a selectable 
30 marker gene and/or nucleotide sequences for the replication in a host cell 

or the integration into the genome of the host cell. 
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16. A transgenic cell or organism containing a nucleotide sequence according 
to claims 7 to 11 and/or a gene construct according to claim 12 and/or a 
vector according to claims 13 to 15. 

17. A transgenic cell or organism according to claim 16 which is an eucaryotic 
cell or organism. 

18. A transgenic cell or organism according to claims 16 or 17 which is a yeast 
cell or a plant cell or a plant. 

19. A transgenic cell or organism according to claims 16 to 18 having an 
altered biosynthetic pathway for the production of triacylglycerol. 

20. A transgenic cell or organism according to claims 16 to 19 having an 
altered oil content. 

21. A transgenic cell or organism according to claims 16 to 20 wherein the 
activity of PDAT is altered. 

22. A transgenic cell or organism according to claims 16 to 21 wherein the 
altered activity of PDAT is characterized by an alteration in gene 
expression, catalytic activity and/or regulation of activity of the enzyme. 

23. A transgenic cell or organism according to claims 16 to 22 wherein the 
altered biosynthetic pathway for the production of triacylglycerol is 
characterized by the prevention of accumulation of undesirable fatty acids 
in the membrane lipids. 

24. A process for the production of triacylglycerol, comprising growing a 
transgenic cell or organism according to claims 16 to 23 under conditions 
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whereby the said nucleotide sequence according to claims 7 to 11 is 
expressed. 

25. Triacylglycerols produced by a process according to claim 24. 

5 

26. Use of a nucleotide sequence according to claims 7 to 11 and/or an 
enzyme according to claims 1 to 6 for the production of triacylglycerol 
and/or triacylglycerols with uncommon fatty acids. 

10 27. Use of a nucleotide sequence according to claims 7 to 11 and/or an 
enzyme according to claims 1 to 6 for the transformation of any cell or 
organism in order to be expressed in this cell or organism and result in an 
altered, preferably increased oil content of this cell or organism. 

15 



SUB STITUTE SHEET (RULE 26) 



WO 00/60095 - 



1/6- 



-PCT/EPOO/02701 



Fig. 1: 
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Lipid synthesis (pmol/min) 
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T2 plant number nmol fatty acids per mg seed 



32-4 



26-14 



standard 



1 


1 277 


+11 (n=2) 


4 


1261 


+63 (11=3^ 


5 


1369 


+17 Cn=3^ 


6 


1312 


TJJ ^lA 1 11 1 / 


7 


1197 




8 


1240 








+54 (n= 5) 


10 


1381 


+35 (n=5) 


1 


1444 


+110 (n=4) 


2 


1617* 


+109 (n=4) 


3 


1374 


+37 (n=2) 


5 


1562* 


+70 (n=4) 


6 


1393 


+77 (n=4) 


7 


1433 


+98 (n=4) 


8 


1581* 


+82 (n=4) 
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Sequence Listing 



<210> 1 

<211> 1986 

<212> genomic DNA 

<213> Saccharomyces cerevisiae 



<221> CDS 

<222> (1) . . (1983) 



ate agt ggc agt gca aaa aga aat gag cgt ggc aaa gat ttc gac agg 
lie Ser Gly Ser Ala Lys Arg Asn Glu Arg Gly Lys Asp Phe Asp Arg 

50 - 55 . 60 

aaa aga gac ggg aac ggt aga aaa cgt tgg aga gat tec aga aga ctg 
Lys Arg Asp Gly Asn Gly Arg Lys Arg Trp Arg Asp Ser Arg Arg Leu 
65 70 75 80 

att ttc att ctt ggt gca ttc tta ggt gta ctt ttg ccg ttt age ttt 
He Phe He Leu Gly Ala Phe Leu Gly Val Leu Leu Pro Phe Ser Phe 
85 90 95 

ggc get tat cat gtt cat aac age gat age gac ttg ttt gac aac ttt 
Gly Ala Tyr His Val His Asn Ser Asp Ser Asp Leu Phe Asp Asn Phe 
100 105 110 



48 



96 



<400> 1 

atg ggc aca ctg ttt cga aga aat gtc cag aac caa aag agt gat tct 

Met Gly Thr Leu Phe Arg Arg Asn Val Gin Asn Gin Lys Ser Asp Ser 

1 5 10 15 

gat gaa aac aat aaa ggg ggt tct gtt cat aac aag cga gag age aga 
Asp Glu Asn Asn Lys Gly Gly Ser Val His Asn Lys Arg Glu Ser Arg 
20 25 30 

aac cac att cat cat caa cag gga tta ggc cat aag aga aga agg ggt 144 
Asn His He His His Gin Gin Gly Leu Gly His Lys Arg Arg Arg Gly 
35 40 45 



192 



240 



288 



336 



gta aat ttt gat tea ctt aaa gtg tat ttg gat gat tgg aaa gat gtt 
Val Asn Phe Asp Ser Leu Lys Val Tyr Leu Asp Asp Trp Lys Asp Val 
115 120 125 

etc cca caa ggt ata agt teg ttt att gat gat att cag get ggt aac 
Leu Pro Gin Gly He Ser Ser Phe He Asp Asp He Gin Ala Gly Asn 
130 135 140 
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tac tec aca tct tct tta gat gat etc age gaa aa*t ttt gec gtt ggt 480 
Tyr Ser Thr Ser Ser Leu Asp Asp Leu Ser Glu Asn Phe Ala Val Gly 
145 150 155 160 

aaa caa etc tta cgt gat tat aat ate gag gee aaa cat cct gtt gta 528 
Lys Gin Leu Leu Arg Asp Tyr Asn lie Glu Ala Lys His Pro Val Val 
165 170 175 

atg gtt cct ggt gtc att tct acg gga att gaa age tgg gga gtt att 57 6 
Met Val Pro Gly Val He Ser Thr Gly He Glu Ser Trp Gly Val He 
180 185 190 

gga gac gat gag tgc gat agt tct gcg cat ttt cgt aaa egg ctg tgg 624 
Gly Asp Asp Glu Cys Asp Ser Ser Ala His Phe Arg Lys Arg Leu Trp 
195 200 205 

gga agt ttt tac atg ctg aga aca atg gtt atg gat aaa gtt tgt tgg 672 
Gly Ser Phe Tyr Met Leu Arg Thr Met Val Met Asp Lys Val Cys Trp 
210 215 220 

ttg aaa cat gta atg tta gat cct gaa aca ggt ctg gac cca ccg aac 720 
Leu Lys His Val Met Leu Asp Pro Glu Thr Gly Leu Asp Pro Pro Asn 
225 230 235 240 

ttt acg eta cgt gca gca cag ggc ttc gaa tea act gat tat ttc ate 7 68 
Phe Thr Leu Arg Ala Ala Gin Gly Phe Glu Ser Thr Asp Tyr Phe He 
245 250 255 

gca ggg tat tgg att tgg aac aaa gtt ttc caa aat ctg gga gta att 816 
Ala Gly Tyr Trp He Trp Asn Lys Val Phe Gin Asn Leu Gly Val He 
260 265 270 

ggc tat gaa ccc aat aaa atg acg agt get gcg tat gat tgg agg ctt 864 
Gly Tyr Glu Pro Asn Lys Met Thr Ser Ala Ala Tyr Asp Trp Arg Leu 
275 280 285 



912 



gca tat tta gat eta gaa aga cgc gat agg tac ttt acg aag eta aag 

Ala Tyr Leu Asp Leu Glu Arg Arg Asp Arg Tyr Phe Thr Lys Leu Lys 
290 295 300 

gaa caa ate gaa ctg ttt cat caa ttg agt ggt gaa aaa gtt tgt tta 960 

Glu Gin He Glu Leu Phe His Gin Leu Ser Gly Glu Lys Val Cys Leu 
305 310 315 320 

att gga cat tct atg ggt tct cag att ate ttt tac ttt atg aaa tgg 1008 

He Gly His Ser Met Gly Ser Gin lie lie Phe Tyr Phe Met Lys Trp 
325 330 335 
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gtc gag gcc gaa ggc cct ctt tac ggt aat ggt g£t cgt ggc tgg get 1056 
Val Glu Ala Glu Gly Pro Leu Tyr Gly Asn Gly Gly Arg Gly Trp Val 
340 345 350 



aac gaa cac ata gat tea 
Asn Glu His lie Asp Ser 
355 

get cca aag gca gtt cca 
Ala Pro Lys Ala Val Pro 
370 

att caa tta aat acg tta 
lie Gin Leu Asn Thr Leu 
385 390 



ttc att aat gca gca ggg 
Phe lie Asn Ala Ala Gly 
360 

get eta att agt ggt gaa 
Ala Leu lie Ser Gly Glu 
375 380 

gcc atg tat ggt ttg gaa 
Ala Met Tyr Gly Leu Glu 
395 



acg ctt ctg ggc 1104 

Thr Leu Leu Gly 

365 

atg aaa gat acc 1152 
Met Lys Asp Thr 



aag ttc ttc tea 1200 
Lys Phe Phe Ser 
400 



aga att gag aga gta aaa atg tta caa acg tgg ggt ggt ata cca tea 1248 
Arg lie Glu Arg Val Lys Met Leu Gin Thr Trp Gly Gly He Pro Ser 
405 410 415 

atg eta cca aag gga gaa gag gtc att tgg ggg gat atg aag tea tct 1296 
Met Leu Pro Lys Gly Glu Glu Val He Trp Gly Asp Met Lys Ser Ser 
420 425 430 



tea gag gat gca ttg aat aac aac act gac aca tac ggc aat ttc att 1344 
Ser Glu Asp Ala Leu Asn Asn Asn Thr Asp Thr Tyr Gly Asn Phe He 
435 440 445 



cga ttt gaa agg aat acg 
Arg Phe Glu Arg Asn Thr 
450 

aaa gac gcc att aac atg 
Lys Asp Ala He Asn Met 
465 470 



age gat get ttc aac aaa 
Ser Asp Ala Phe Asn Lys 
455 460 

aca tta teg ata tea cct 
Thr Leu Ser He Ser Pro 
475 



aat ttg aca atg 1392 
Asn Leu Thr Met 



gaa tgg etc caa 1440 
Glu Trp Leu Gin 
480 



aga aga gta cat gag cag tac teg ttc ggc tat tec aag aat gaa gaa 1488 
Arg Arg Val His Glu Gin Tyr Ser Phe Gly Tyr Ser Lys Asn Glu Glu 
485 490 495 



gag tta aga aaa aat gag eta cac cac aag cac tgg teg aat cca atg 15 3 6 

Glu Leu Arg Lys Asn Glu Leu His His Lys His Trp Ser Asn Pro Met 
500 505 510 

gaa gta cca ctt cca gaa get ccc cac atg aaa ate tat tgt ata tac 1584 

Glu Val Pro Leu Pro Glu Ala Pro His Met Lys He Tyr Cys He Tyr 

515 520 525 
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ggg gtg aac aac cca act gaa agg gca tat gta tat aag gaa gag gat 1632 

Gly Val Asn Asn Pro Thr Glu Arg Ala Tyr Val Tyr Lys Glu Glu Asp 
530 535 540 



gac tec tct get ctg aat ttg acc ate gac tac gaa age aag caa cct 
Asp Ser Ser Ala Leu Asn Leu Thr lie Asp Tyr Glu Ser Lys Gin Pro 
545 550 555 560 

gta ttc etc acc gag ggg gac gga acc gtt ccg etc gtg gcg cat tea 
Val Phe Leu Thr Glu Gly Asp Gly Thr Val Pro Leu Val Ala His Ser 
565 570 575 

atg tgt cac aaa tgg gec cag ggt get tea ccg tac aac cct gec gga 
Met Cys His Lys Trp Ala Gin Gly Ala Ser Pro Tyr Asn Pro Ala Gly 
580 585 590 

att aac gtt act att gtg gaa atg aaa cac cag cca gat cga ttt gat 
lie Asn Val Thr lie Val Glu Met Lys His Gin Pro Asp Arg Phe Asp 
595 600 605 

ata cgt ggt gga gca aaa age gee gaa cac gta gac ate etc ggc age 
He Arg Gly Gly Ala Lys Ser Ala Glu His Val Asp He Leu Gly Ser 
610 615 620 

gcg gag ttg aac gat tac ate ttg aaa att gca age ggt aat ggc gat 
Ala Glu Leu Asn Asp Tyr He Leu Lys He Ala Ser Gly Asn Gly Asp 
625 630 635 640 

etc gtc gag cca cgc caa ttg tct aat ttg age cag tgg gtt tct cag 
Leu Val Glu Pro Arg Gin Leu Ser Asn Leu Ser Gin Trp Val Ser Gin 
645 650 655 

atg ccc ttc cca atg taa 
Met Pro Phe Pro Met 
660 
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<210> 2 
<211> 661 
<212> PRT 

<213> Saccharontyces cerevisiae 
<400> 2 

Met Gly Thr Leu Phe Arg Arg Asn Val Gin Asn Gin Lys Ser Asp Ser 

15 10 15 

Asp Glu Asn Asn Lys Gly Gly Ser Val His Asn Lys Arg Glu Ser Arg 

20 25 30 

Asn His lie His His Gin Gin Gly Leu Gly His Lys Arg Arg Arg Gly 

35 40 45 

lie Ser Gly Ser Ala Lys Arg Asn Glu Arg Gly Lys Asp Phe Asp Arg 

50 55 60 

Lys Arg Asp Gly Asn Gly Arg Lys Arg Trp Arg Asp Ser Arg Arg Leu 
65 70 75 80 

lie Phe lie Leu Gly Ala Phe Leu Gly Val Leu Leu Pro Phe Ser Phe 

85 90 95 

Gly Ala Tyr Kis Val His Asn Ser Asp Ser Asp Leu Phe Asp Asn Phe 

100 105 110 

Val Asn Phe Asp Ser Leu Lys Val Tyr Leu Asp Asp Trp Lys Asp Val 

115 120 125 

Leu Pro Gin Gly lie Ser Ser Phe lie Asp Asp lie Gin Ala Gly Asn 

130 135 140 

Tyr Ser Thr Ser Ser Leu Asp Asp Leu Ser Glu Asn Phe Ala Val Gly 
145 150 155 160 

Lys Gin Leu Leu Arg Asp Tyr Asn lie Glu Ala Lys His Pro Val Val 

165 170 175 

Met Val Pro Gly Val lie Ser Thr Gly lie Glu Ser Trp Gly Val lie 

180 185 190 

Gly Asp Asp Glu Cys Asp Ser Ser Ala His Phe Arg Lys Arg Leu Trp 

195 200 205 

Gly Ser Phe Tyr Met Leu Arg Thr Met Val Met Asp Lys Val Cys Trp 

210 215 220 

Leu Lys His Val Met Leu Asp Pro Glu Thr Gly Leu Asp Pro Pro Asn 
225 230 235 240 

Phe Thr Leu Arg Ala Ala Gin Gly Phe Glu Ser Thr Asp Tyr Phe lie 

245 250 255 

Ala Gly Tyr Trp lie Trp Asn Lys Val Phe Gin Asn Leu Gly Val lie 

260 265 270 

Gly Tyr Glu Pro Asn Lys Met Thr Ser Ala Ala Tyr Asp Trp Arg Leu 

275 280 285 

Ala Tyr Leu Asp Leu Glu Arg Arg Asp Arg Tyr Phe Thr Lys Leu Lys 

290 295 300 

Glu Gin He Glu Leu Phe His Gin Leu Ser Gly Glu Lys Val Cys Leu 
305 310 315 320 

He Gly His Ser Met Gly Ser Gin He He Phe Tyr Phe Met Lys Trp 
325 330 335 
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Val Glu Ala Glu Gly Pro Leu Tyr Gly Asn Gly Gly Arg Gly Trp Val 

340 345 350 

Asn Glu His lie Asp Ser Phe He Asn Ala Ala Gly Thr Leu Leu Gly 

355 360 365 

Ala Pro Lys Ala Val Pro Ala Leu He Ser Gly Glu Met Lys Asp Thr 

370 375 380 

He Gin Leu Asn Thr Leu Ala Met Tyr Gly Leu Glu Lys Phe Phe Ser 
385 390 395 400 

Arg He Glu Arg Val Lys Met Leu Gin Thr Trp Gly Gly He Pro Ser 

405 410 415 

Met Leu Pro Lys Gly Glu Glu Val He Trp Gly Asp Met Lys Ser Ser 

420 425 430 

Ser Glu Asp Ala Leu Asn Asn Asn Thr Asp Thr Tyr Gly Asn Phe He 

435 440 445 

Arg Phe Glu Arg Asn Thr Ser Asp Ala Phe Asn Lys Asn Leu Thr Met 

450 455 460 

Lys Asp Ala He Asn Met Thr Leu Ser He Ser Pro Glu Trp Leu Gin 
465 470 475 480 

Arg Arg Val His Glu Gin Tyr Ser Phe Gly Tyr Ser Lys Asn Glu Glu 

485 490 495 

Glu Leu Arg Lys Asn Glu Leu His His Lys His Trp Ser Asn Pro Met 

500 505 510 

Glu Val Pro Leu Pro Glu Ala Pro His Met Lys He Tyr cys He Tyr 

515 520 525 

Gly Val Asn Asn Pro Thr Glu Arg Ala Tyr Val Tyr Lys Glu Glu Asp 

530 535 540 

Asp Ser Ser Ala Leu Asn Leu Thr lie Asp Tyr Glu Ser Lys Gin Pro 
545 550 555 560 

Val Phe Leu Thr Glu Gly Asp Gly Thr Val Pro Leu Val Ala His Ser 

565 570 575 

Met Cys His Lys Trp Ala Gin Gly Ala Ser Pro Tyr Asn Pro Ala Gly 

580 585 590 

He Asn Val Thr He Val Glu Met Lys His Gin Pro Asp Arg Phe Asp 

595 600 605 

He Arg Gly Gly Ala Lys Ser Ala Glu His Val Asp He Leu Gly Ser 

610 615 620 

Ala Glu Leu Asn Asp Tyr He Leu Lys He Ala Ser Gly Asn Gly Asp 
625 630 635 640 

Leu Val Glu Pro Arg Gin Leu Ser Asn Leu Ser Gin Trp Val Ser Gin 

645 650 655 

Met Pro Phe Pro Met 
660 
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<210> 3 ' 

<211> 2312 

<212> genomic DNA 

<213> Schizosaccharomyces pombe 



<4G0> 3 

ATGGCGTCTT CCAAGAAGAG CAAAACTCAT AAGAAAAAGA AAGAAGTCAA 50 

ATCTCCTATC GACTTACCAA ATTCAAAGAA ACCAACTCGC GCTTTGAGTG 100 

AGCAACCTTC AGCGTCCGAA ACACAATCTG TTTCAAATAA ATCAAGAAAA 150 

TCTAAATTTG GAAAAAGATT GAATTTTATA TTGGGCGCTA TTTTGGGAAT 200 

ATGCGGTGCT TTTTTTTTCG CTGTTGGAGA CGACAATGCT GTTTTCGACC 250 

CTGCTACGTT AGATAAATTT GGGAATATGC TAGGCTCTTC AGACTTGTTT 300 

GATGACATTA AAGGATATTT ATCTTATAAT GTGTTTAAGG ATGCACCTTT 350 

TACTACGGAC AAGCCTTCGC AGTCTCCTAG CGGAAATGAA GTTCAAGTTG 400 

GTCTTGATAT GTACAATGAG GGATATCGAA GTGACCATCC TGTTATTATG 450 

GTTCCTGGTG TTATCAGCTC AGGATTAGAA AGTTGGTCGT TTAATAATTG 500 

CTCGATTCCT TACTTTAGGA AACGTCTTTG GGGTAGCTGG TCTATGCTGA 550 

AGGCAATGTT CCTTGACAAG CAATGCTGGC TTGAACATTT AATGCTTGAT 600 

AAAAAAACCG GCTTGGATCC GAAGGGAATT AAGCTGCGAG CAGCTCAGGG 650 

GTTTGAAGCA GCTGATTTTT TTATCACGGG CTATTGGATT TGGAGTAAAG 700 

TAATTGAAAA CCTTGCTGCA ATTGGTTATG AGCCTAATAA CATGTTAAGT 7 50 

GCTTCTTACG ATTGGCGGTT ATCATATGCA AATTTAGAGG AACGTGATAA 800 

ATATTTTTCA AAGTTAAAAA TGTTCATTGA GTACAGCAAC ATTGTACATA 850 

AGAAAAAGGT AGTGTTGATT TCTCACTCCA TGGGTTCACA GGTTACGTAC 900 

TATTTTTTTA AGTGGGTTGA AGCTGAGGGC TACGGAAATG GTGGACCGAC 950 

TTGGGTTAAT GATCATATTG AAGCATTTAT AAATGTGAGT CTCGATGGTT 1000 
GTTTGACTAC GTTTCTAACT TTTGAATAGA TATCGGGATC TTTGATTGGA 1050 

GCACCCAAAA CAGTGGCAGC GCTTTTATCG GGTGAAATGA AAGATACAGG 1100 

TATTGTAATT ACATTAAACA TGTTAATATT TAATTTTTGC TAACCGTTTT 1150 

AAGCTCAATT GAATCAGTTT TCGGTCTATG GGTAAGCAAT AAATTGTTGA 1200 

GATTTGTTAC TAATTTACTG TTTAGTTTGG AAAAATTTTT TTCCCGTTCT 1250 

GAGGTATATT CAAAAATACA AATGTGCTCT ACTTTTTCTA ACTTTTAATA 1300 

GAGAGCCATG ATGGTTCGCA CTATGGGAGG AGTTAGTTCT ATGCTTCCTA 1350 

AAGGAGGCGA TGTTGTATGG GGAAATGCCA GTTGGGTAAG AAATATGTGC 1400 

TGTTAATTTT TTATTAATAT TTAGGCTCCA GATGATCTTA ATCAAACAAA 1450 

TTTTTCCAAT GGTGCAATTA TTCGATATAG AGAAGACATT G AT AAGGAC C 1500 

ACGATGAATT TGACATAGAT GATGCATTAC AATTTTTAAA AAATGTTACA 1550 

GATGACGATT TTAAAGTCAT GCTAGCGAAA AATTATTCCC ACGGTCTTGC 1600 
TTGGACTGAA AAAGAAGTGT TAAAAAATAA CGAAATGCCG TCTAAATGGA 1650 

TAAATCCGCT AGAAGTAAGA ACATTAAAGT TACTAAATTA TACTAACCCA 1700 

AATAGACTAG TCTTCCTTAT GCTCCTGATA TGAAAATTTA TTGCGTTCAC 1750 

GGGGTCGGAA AACCAACTGA GAGAGGTTAT TATTATACTA ATAATCCTGA 1800 

GGGGCAACCT GTCATTGATT CCTCGGTTAA TGATGGAACA AAAGTTGAAA 1850 

ATGTGAGAGA ATTTATGTTT CAAACATTCT ATTAACTGTT TTATTAGGGT 1900 

ATTGTTATGG ATGATGGTGA TGGAACTTTA CCAATATTAG CCCTTGGTTT 1950 

GGTGTGCAAT AAAGTTTGGC AAACAAAAAG GTTTAATCCT GCTAATACAA 2000 

GTATCACAAA TTATGAAATC AAGCATGAAC CTGCTGCGTT TGATCTGAGA 2050 

GGAGGACCTC GCTCGGCAGA ACACGTCGAT AT AC TTGGAC ATTCAGAGCT 2100 

AAATGTATGT TCATTTTACC TTACAAATTT CTATTACTAA CTCTTGAAAT 2150 
AAGGAAATTA TTTTAAAAGT TTCATCAGGC CATGGTGACT CGGTACCAAA 2200 

CCGTTATATA TCAGATATCC AGTACGGACA TAAGTTTTGT AGATTGCAAT 2250 

TAACTAACTA ACCGAACAGG GAAATAATAA ATGAGATAAA TCTCGATAAA 23 00 
CCTAGAAATT AA 2312 
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<210>~4 " 

<211> 3685 

<212> genomic DNA 

<213> Arabidopsis thaliana 



<400> 4 

ATGCCCCTTA TTCATCGGAA AAAGCCGACG GAGAAACCAT CGACGCCGCC 50 

ATCTGAAGAG GTGGTGCACG ATGAGGATTC GCAAAAGAAA C C AC ACGAAT 100 

CTTCCAAATC CCACCATAAG AAATCGAACG GAGGAGGGAA GTGGTCGTGC 150 

ATCGATTCTT GTTGTTGGTT CATTGGGTGT GTGTGTGTAA CCTGGTGGTT 200 

TCTTCTCTTC CTTTACAACG CAATGCCTGC GAGCTTCCCT CAGTATGTAA 250 

CGGAGCGAAT CACGGGTCCT TTGCCTGACC CGCCCGGTGT TAAGCTCAAA 300 

AAAGAAGGTC TTAAGGCGAA ACATCCTGTT GTCTTCATTC CTGGGATTGT 350 

CACCGGTGGG CTCGAGCTTT GGGAAGGCAA AC AATGCGC T GATGGTTTAT 400 

TTAGAAAACG TTTGTGGGGT GGAACTTTTG GTGAAGTCTA CAAAAGGTGA 450 

GCTCAACAAT TCTCACTCTT CCTTTATATT GGGATTTGGA TTGGATCTGA 500 

TGAGATCACG CACTTGTTGC TTCTTCAACA TCACTCAAAC TTTAATTCCA 550 

TGTTTGTCTG TCTTACTCTT TACTTTTTTT TTTTTTTGAT GTGAAACGCT 600 

ATTTTCTTAA GAGACTATTT CTGTATGTGT AAGGTAAGCG TTCCAAGGAC 650 

GTAATTGGCT TGGACTATTT CTGTTTGATT GTTAACTTTA GGATATAAAA 700 

TAGCTGCCTT GGAATTTCAA GTCATCTTAT TGCCAAATCT GTTGCTAGAC 750 

ATGCCCTAGA GTCCGTTCAT AACAAGTTAC TTCCTTTACT GTCGTTGCGT 800 

GTAGATTTAG C TTTGTGT AG CGTATAATGA AGTAGTGTTT TATGTTTTGT 850 

TGGGAATAGA GAAGTTCTAA CTACATCTGT GGAAAGTGTG TTCAGGCTGT 900 

GATAGAGGAC TGTTGCTTTA TTATTCAACT ATGTATATGT GTAATTAAAG 950 

CTAGTTCCTT TTTGATCTTT CAGCTCAATG TGCTTTTCTC AATTTTTTTC 1000 

TCAATTTCAA AGTTTCACAT CGAGTTTATT CACATGTCTT GAATTTCGTC 1050 

CATCCTCGTT CTGTTATCCA GCTTTGAACT CCTCCCGACC CTGCTATGGA 1100 

TATATTAAAA AAAAAGTGTT TTGTGGGTTG CATCTTTGTT ACGATCTGCA 1150 

TCTTCTTCTT TCGGCTCAGT GTTCATGTTT TTGCTATGGT AGAGATGGGC 1200 

AATGTTATTG TTGATGGTAA CAGTGGTATA GTTGATAGTA TCTTAACTAA 1250 

TCAATTATCT CTTTGATTCA GGCCTCTATG TTGGGTGGAA CACATGTCAC 13 00 

TTGACAATGA AACTGGGTTG GATCCAGCTG GTATTAGAGT TCGAGCTGTA 1350 

TCAGGACTCG TGGCTGCTGA CTACTTTGCT CCTGGCTACT TTGTCTGGGC 1400 

AGTGCTGATT GCTAACCTTG CACATATTGG ATATGAAGAG AAAAATATGT 1450 

ACATGGCTGC ATATGACTGG CGGCTTTCGT TTCAGAACAC AGAGGTTCTT 1500 

TTCTCATCGT TCTTTCTATT ATTCTGTTCC ATGTTACGTT TCTTTCTTCA 1550 

TTACTTAAGG CTTAAATATG TTTCATGTTG AATTAATAGG TACGTGATCA 1600 

GACTCTTAGC CGTATGAAAA GTAATATAGA GTTGATGGTT TCTACCAACG 1650 

GTGGAAAAAA AGCAGTTATA GTTCCGCATT CCATGGGGGT CTTGTATTTT 1700 

CTACATTTTA TGAAGTGGGT TGAGGCACCA GCTCCTCTGG GTGGCGGGGG 1750 

TGGGCCAGAT TGGTGTGCAA AGTATATTAA GGCGGTGATG AACATTGGTG 1800 

GACCATTTCT TGGTGTTC C A AAAGC TGTTG CAGGGCTTTT CTCTGCTGAA 1850 

GCAAAGGATG TTGCAGTTGC CAGGTATTGA ATATCTGCTT ATACTTTTGA 1900 

TGATCAGAAC CTTGGCTCTG GAACTCAAAG TTATTCTACT AAATATCAAT 1950 

TCTAATAACA TTGCTATATT ATCGCTGCAA CTGACATTGG TTGATTATTT 2000 

TTGCTGCTTA TGTAACTGAA ACTCTCTTGA GATTAGACAA ATGATGAATT 2050 

GATAATTCTT ACGCATTGCT CTGTGATGAC CAGTTTCTTA GCTTCGACGA 2100 

TAACATTTGT CATACTGTCT TTTGGAGGGC ATTGAATTTT GCTATGGAAA 2150 

GCGCTGGAGC TTCCATGCTT GCATTCTTTA CCAATTAGCG TTATTCTGCT 22 00 

TCTTTCAATT TTCTTGTATA TGCATCTATG GTCTTTTATT TCTTCTTAAT 2250 

TAAAGACTCG TTGGATTAGT TGCTCTATTA GTCACTTGGT TCCTTAATAT 2300 

AGAACTTTAC TTTCTTCGAA AATTGCAGAG CGATTGCCCC AGGATTCTTA 2350 

GACACCGATA TATTTAGACT TCAGACCTTG CAGCATGTAA TGAGAATGAC 2400 

ACGCACATGG GACTCAACAA TGTCTATGTT ACCGAAGGGA GGTGACACGA 2450 

TATGGGGCGG GCTTGATTGG TCACCGGAGA AAGGCCACAC CTGTTGTGGG 2500 

AAAAAGCAAA AG AAC AACGA AACTTGTGGT GAAGCAGGTG AAAACGGAGT 25 50 

TTCCAAGAAA AGTCCTGTTA ACTATGGAAG GATGATATCT TTTGGGAAAG 2 600 

AAGTAGCAGA GGCTGCGCCA TCTGAGATTA ATAATATTGA TTTTCGAGTA 2 650 

AGGACATATA AATCATAATA AACCTTGTAC ATTTTGTGAT TGTATGATGA 2700 

ATATCTGTAC ATTTTATCTG GTGAAGGGTG CTGTCAAAGG TCAGAGTATC 2750 

CCAAATCACA CCTGTCGTGA CGTGTGGACA GAGTACCATG ACATGGGAAT 2 800 

TGCTGGGATC AAAGCTATCG CTGAGTATAA GGTCTACACT GCTGGTGAAG 28 50 
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CTATAGATCT ACT AC AT TAT GTTGCTCCTA AGATGATGGC GCGTGGTGCC 2T30"0™ 

GCTCATTTCT CTTATGGAAT TGCTGATGAT TTGGATGACA CCAAGTATCA 2950 

AGATCCCAAA TACTGGTCAA ATCCGTTAGA GACAAAGTAA GTGATTTCTT 3000 

GATTCCAACT GTATCCTTCG TCCTGATGCA TTATCAGTCT TTTTGTTTTC 3050 

GGTCTTGTTG GATATGGTTT TCAGCTCAAA GCTTACAAAG CTGTTTCTGA 3100 

GCCTTTCTCA AAAAGGCTTG CTCAGTAATA TTGAGGTGCT AAAGTTGATA 3150 

CATGTGACTC TTGCTTATAA ATCCTCCGTT TGGTTTGTTC TGCTTTTTCA 3200 

GATTACCGAA TGCTCCTGAG ATGGAAATCT ACTCATTATA CGGAGTGGGG 3250 
ATACCAACGG AACGAGCATA CGTATACAAG CTTAACCAGT CTCCCGACAG 3300 

TTGCATCCCC TTTCAGATAT TCACTTCTGC TCACGAGGAG GACGAAGATA 3350 

GCTGTCTGAA AGCAGGAGTT TACAATGTGG ATGGGGATGA AACAGTACCC 3400 

GTCCTAAGTG CCGGGTACAT GTGTGCAAAA GCGTGGCGTG GCAAGACAAG 3450 

ATTCAACCCT TCCGGAATCA AGACTTATAT AAGAGAATAC AATCACTCTC 3500 

CGCCGGCTAA CCTGTTGGAA GGGCGCGGGA CGCAGAGTGG TGCCCATGTT 3550 

GATATCATGG GAAACTTTGC TTTGATCGAA GATATCATGA GGGTTGCCGC 3600 

CGGAGGTAAC GGGTCTGATA TAGGACATGA CCAGGTCCAC TCTGGCATAT 3 650 
TTGAATGGTC GGAGCGTATT GACCTGAAGC TGTGA 3 685 
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<210> 5 
<211> 2427 
<212> cDNA 

<213> Arabidopsis thaliana 



<400> 5 

AGAAACAGCT CTTTGTCTCT CTCGACTGAT CTAACAATCC CTAATCTGTG 50 

TTCTAAATTC CTGGACGAGA TTTGACAAAG TCCGTATAGC TTAACCTGGT 100 

TTAATTTCAA GTGACAGATA TGCCCCTTAT TCATCGGAAA AAGCCGACGG 150 

AGAAACCATC GACGCCGCCA TCTGAAGAGG TGGTGCACGA TGAGGATTCG 200 

CAAAAGAAAC CACACGAATC TTCCAAATCC CACCATAAGA AATCGAACGG 250 

AGGAGGGAAG TGGTCGTGCA TCGATTCTTG TTGTTGGTTC ATTGGGTGTG 300 

TGTGTGTAAC CTGGTGGTTT CTTCTCTTCC TTTACAACGC AATGCCTGCG 350 

AGCTTCCCTC AGTATGTAAC GGAGCGAATC ACGGGTCCTT TGCCTGACCC 400 
GCCCGGTGTT AAGCTCAAAA AAAGAAGGTC TTAAGGCGAA ACATCCTGTT 450 

GTCTTCATTC CTGGGATTGT CACCGGTGGG CTCGAGCTTT GGGAAGGCAA 500 

ACAATGCGCT GATGGTTTAT TTAGAAAACG TTTGTGGGGT GGAACTTTTG 550 

GTGAAGTCTA CAAAAGGCCT CTATGTTGGG TGGAACACAT GTCACTTGAC 600 

AATGAAACTG GGTTGGATCC AGCTGGTATT AGAGTTCGAG CTGTATCAGG 650 

ACTCGTGGCT GCTGACTACT TTGCTCCTGG CTACTTTGTC TGGGCAGTGC 700 

TGATTGCTAA CCTTGCACAT ATTGGATATG AAGAGAAAAA TATGTACATG 750 

GCTGCATATG ACTGGCGGCT TTCGTTTCAG AACACAGAGG TACGTGATCA 800 

GACTCTTAGC CGTATGAAAA GTAATATAGA GTTGATGGTT TCTACCAACG 850 

GTGGAAAAAA AGCAGTTATA GTTCCGCATT CCATGGGGGT CTTGTATTTT 900 

CTACATTTTA TGAAGTGGGT TGAGGCACCA GCTCCTCTGG GTGGCGGGGG 950 
TGGGCCAGAT TGGTGTGCAA AGTATATTAA GGCGGTGATG AACATTGGTG 1000 

GACCATTTCT TGGTGTTCCA AAAGCTGTTG CAGGGCTTTT CTCTGCTGAA 1050 

GCAAAGGATG TTGCAGTTGC C AGAGCG ATT GCCCCAGGAT TCTTAGACAC 1100 

CGATATATTT AGACTTCAGA CCTTGCAGCA TGTAATGAGA ATGACACGCA 1150 

CATGGGACTC AACAATGTCT ATGTT AC CGA AGGGAGGTGA CACGATATGG 1200 

GGCGGGCTTG ATTGGTCACC GGAGAAAGGC CACACCTGTT GTGGGAAAAA 1250 

GCAAAAGAAC AACGAAACTT GTGGTGAAGC AGGTGAAAAC GGAGTTTCCA 1300 

AGAAAAGTCC TGTTAACTAT GGAAGGATGA TATCTTTTGG GAAAGAAGTA 1350 

GCAGAGGCTG CGCCATCTGA GATTAATAAT ATTGATTTTC GAGGTGCTGT 1400 

CAAAGGTCAG AGTATCCCAA ATCACACCTG TCGTGACGTG TGGACAGAGT 1450 

ACCATGACAT GGGAATTGCT GGGATCAAAG CTATCGCTGA GTATAAGGTC 1500 

TACACTGCTG GTGAAGCTAT AGATCTACTA CATTATGTTG CTCCTAAGAT 1550 

GATGGCGCGT GGTGCCGCTC ATTTCTCTTA TGGAATTGCT GATGATTTGG 1600 

ATGACACCAA GTATCAAGAT CCCAAATACT GGTCAAATCC GTTAGAGACA 1650 
AAATT AC CGA ATGCTCCTGA GATGGAAATC TACTCATTAT ACGGAGTGGG 1700 
GATACCAACG GAACGAGCAT ACGTATACAA GCTTAACCAG TCTCCCGACA 1750 
GTTGCATCCC CTTTCAGATA TTCACTTCTG CTCACGAGGA GGACGAAGAT 1800 
AGCTGTCTGA AAGCAGGAGT TTACAATGTG GATGGGGATG AAACAGTACC 1850 
CGTCCTAAGT GCCGGGTACA TGTGTGCAAA AGCGTGGCGT GGCAAGACAA 1900 
GATTCAACCC TTCCGGAATC AAGACTTATA TAAGAGAATA CAATCACTCT 1950 
CCGCCGGCTA ACCTGTTGGA AGGGCGCGGG ACGCAGAGTG GTGCCCATGT 2000 
TGATATCATG GGAAACTTTG CTTTGATCGA AGATATCATG AGGGTTGCCG 2050 
CCGGAGGTAA CGGGTCTGAT AT AGGAC ATG ACCAGGTCCA CTCTGGCATA 2100 
TTTGAATGGT CGGAGCGTAT TGACCTGAAG CTGTGAATAT CkTGATCTCT 2150 
TTAAGCTGTC CTGTCAGCTT ATGTGAATCC AATACTTTGA AAGAGAGATC 2200 
ATCATCAATT CATCATCATC GTCATCATCA TGATGCTCAA CTCACAAAGA 2250 
AGCCTGAGAA TGATACTTTG GTgCGAAATT CTCAATACCT CTTTAATATT 2300 
CTTATTGAAT GT AAATT ATA CAATCCTATC TAATGTTTGA ACGATAACAC 2350 
AAAACTTGCT GCNGCCATGT TTGTTTGTCT TGTCAAAAGC ATCAATTTGT 2400 
GGGTTAAAAA AAAAAAAAAA AAAAAAA 2427 
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<210>~~6 1 

<211> 671 
<212> PRT 

<213> Arabidopsis thaliana 



<400> 6 

MPLIHRKKPT EKPSTPPSEE WHDEDSQKK 
IDSCCWFIGC VCVTWWFLLF LYNAMPASFP 
KEGLKAKHPV VFIPGIVTGG LELWEGKQCA 
LCWVEHMSLD NETGLDPAGI RVRAVSGLVA 
IGYEEKNMYM AAYDWRLSFQ NTEVRDQTLS 
VPHSMGVLYF LHFMKWVEAP APLGGGGGFD 
KAVAGLFSAE AKDVAVARAI APGFLDTDIF 
MLPKGGDTIW GGLDWSPEKG HTCCGKKQKN 
GRMISFGKEV AEAAPSEINN IDFRGAVKGQ 
GIKAIAEYKV YTAGEAIDLL HYVAPKMMAR 
PKYWSNPLET KLPNAPEMEI YSLYGVGIPT 
FTSAHEEDED SCLKAGVYNV DGDETVPVLS 
KTYIREYNHS PPANLLEGRG TQSGAHVDIM 
IGHDQVHSGI FEWSERIDLK L 



PHESSKSHHK 


KSNGGGKWSC 


50 


QYVTERITGP 


LPDPPGVKLK 


100 


DGLFRKRLWG 


GTFGEVYKRP 


150 


ADYFAPGYFV 


WAVLIANLAH 


200 


RMKSNIELMV 


STNGGKKAVI 


250 


WCAKYIKAVM 


NIGGPFLGVP 


300 


RLQTLQHVMR 


MTRTWDSTMS 


350 


NETCGEAGEN 


GVSKKSPVNY 


400 


SIPNHTCRDV 


WTEYHDMGIA 


450 


GAAHFSYGIA 


DDLDDTKYQD 


500 


ERAYVYKLNQ 


SPDSCIPFQI 


550 


AGYMCAKAWR 


GKTRFNPSGI 


600 


GNFALIEDIM 


RVAAGGNGSD 


650 



671 
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<210> 7 
<211> 643 
<212> cDNA 
<213> Zea mays 

<221> CDS 
<222> (1) . . (402) 

<400> 6 

CGG GAG AAA ATA GCT GCT TTG AAG GGG GGT GTT TAC TTA GCC GAT GGT 48 

Arg Glu Lys He Ala Ala Leu Lys Gly Gly Val Tyr Leu Ala Asp Gly 
15 10 15 

GAT GAA ACT GTT CCA GTT CTT AGT GCG GGC 'fAC ATG TGT GCG AAA GGA 96 
Asp Glu Thr Val Pro Val Leu Ser Ala Gly Tyr Met Cys Ala Lys Gly 
20 25 30 

TGG CGT GGC AAA ACT CGT TTC AGC CCT GCC GGC AGC AAG ACT TAC GTG 144 
Trp Arg Gly Lys Thr Arg Phe Ser Pro Ala Gly Ser Lys Thr Tyr Val 
35 40 45 

AGA GAA TAC AGC CAT TCG CCA CCC TCT ACT CTC CTG GAA GGC AGG GGC 192 
Arg Glu Tyr Ser His Ser Pro Pro Ser Thr Leu Leu Glu Gly Arg Gly 
50 55 60 
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ACC CAG AGC GGT GCA CAT GTT GAT ATA ATG GGG AAC TTT GCT CTA ATT 240 
Thr Gin Ser Gly Ala His Val Asp He Met Gly Asn Phe Ala Leu He 
65 70 75 80 

GAG GAC GTC ATC AGA ATA GCT GCT GGG GCA ACC GGT GAG GAA ATT GGT 288 
Glu Asp Val He Arg He Ala Ala Gly Ala Thr Gly Glu Glu He Gly 

8 5 90 95 

GGC GAT CAG GTT TAT TCA GAT ATA TTC AAG TGG TCA GAG AAA ATC AAA 336 
Gly Asp Gin Val Tyr Ser Asp He Phe Lys Trp Ser Glu Lys He Lys 
100 105 HO 

TTG AAA TTG TAA CCTATGGGAA GTTAAAGAAG TGCCGACCCG TTTATTGCGTTCC 3 91 
Leu Lys Leu 
115 

AAAGTGTCCT GCCTGAGTGC AACTCTGGAT TTTGCTTAAA TATTGTAATT TTTCACGC 44 9 
TTCATTCGTC CCTTTGTCAA ATTTACATTT GACAGGACGC CAATGCGATA CGATGTTG 507 
TACCGCTATT TTCAGCATTG TATATTAAAC TGTACAGGTG TAAGTTGCAT TTGCCAGC 5 65 
TGAAATTGTG TAGTCGTTTT CTTTACGATT TAATANCAAG TGGCGGAGCA GTGCCCCA 623 
AGCNAAAAAA AAAAAAAAAA 643 
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<2 1Q > 8 

<211> 115 
<212> PRT 
<213> Zea mays 

<400> 8 

Arg Glu Lys lie Ala Ala Leu Lys Gly Gly Val Tyr Leu Ala Asp Gly 
15 10 15 

Asp Glu Thr Val Pro Val Leu Ser Ala Gly Tyr Met Cys Ala Lys Gly 
20 25 30 

Trp Arg Gly Lys Thr Arg Phe Ser Pro Ala Gly Ser Lys Thr Tyr Val 
35 40 45 

Arg Glu Tyr Ser His Ser Pro Pro Ser Thr Leu Leu Glu Gly Arg Gly 
50 55 60 

Thr Gin Ser Gly Ala His Val Asp lie Met Gly Asn Phe Ala Leu lie 
65 " 70 75 80 

Glu Asp Val He Arg He Ala Ala Gly Ala Thr Gly Glu Glu He Gly 

85 90 95 

Gly Asp Gin Val Tyr Ser Asp He Phe Lys Trp Ser Glu Lys He Lys 
100 105 HO 



Leu Lys Leu 
115 
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<2X0>-9 

<211> 616 
<212> CDNA 

<213> Neurospora crassa 



<400> 9 

ggcggcgaag acganggcgg 
agatggatct accctctaga 
gtntacngtt tntacgggta 
tggcgcccga tcccgggacg 
actttgactn aggggcacat 
tggcacagtg aaccttatga 
aaatgaagag atacaaccct 
ccgcatgaac cagaacggtt 
tcacgtggat attctaggaa 
tggcggcagg tcgaggcgat 
cttaaatatg tagaaaaggt 
acataggtta cccaatagta 
aaaaaaaaaa aaaaaa 



aagttggagg ctaacgagaa 
gacacgacta ccnttgcacc 
ggaagccgac ggagcgagcc 
acaacgcatc tttagatgac 
tgaccacggt gtgattttgg 
gtttggggta cctgcgcaat 
gcgggctcaa aaataaccgt 
caatccgaga ggagggccga 
ggcagaatct aaacgagtac 
acaattgagg attttattac 
tgaaatttat gaagagtaat 
tgactaatta aaaaaaaatt 

616 



tgacnctcgg 


50 


cagcctcaag 


100 


tacatctacc 


150 


gatcgatacg 


200 


gcgaaggcga 


250 


aaggggtgga 


300 


ggtcgagatg 


350 


atacggcgga 


400 


actcttaaag 


450 


tagcaatatt 


500 


taaatacggc 


550 


ttttttccaa 


600 



SUBSTITUTE SHEET (RULE 26) 



WO 00/60095 



15 /53 



PCT/EP00/02701 



<21D>^XD 

<211> 1562 

<212> genomic DNA 

<213> Arabidopsis thaliana 



<400> 10 

ATGAAAAAAA TATCTTCACA TTATTCGGTA GTCATAGCGA TACTCGTTGT 50 

GGTGACGATG ACCTCGATGT GTCAAGCTGT GGGTAGCAAC GTGTACCCTT 100 

TGATTCTGGT TCCAGGAAAC GGAGGTAACC AGCTAGAGGT ACGGCTGGAC 150 

AGAGAATACA AGCCAAGTAG TGTCTGGTGT AGCAGCTGGT TATATCCGAT 200 

TCATAAGAAG AGTGGTGGAT GGTTTAGGCT ATGGTTCGAT GCAGCAGTGT 250 

TATTGTCTCC CTTCACCAGG TGCTTCAGCG -ATCGAATGAT GTTGTACTAT 300 

GACCCTGATT TGGATGATTA CCAAAATGCT CCTGGTGTCC AAACCCGGGT 350 

TCCTCATTTC GGTTCGACCA AATCACTTCT ATACCTCGAC CCTCGTCTCC 400 

GGTTAGTACT TTCCAAGATA TATCATTTTG GGACATTTGC ATAATGAACA 450 

AAATAGACAT AAATTTGGGG GATTATTGTT ATATCAATAT CCATTTATAT 500 

GCTAGTCGGT AATGTGAGTG TTATGTTAGT ATAGTTAATG TGAGTGTTAT 550 

GTGATTTTCC ATTTTAAATG AAGCTAGAAA GTTGTCGTTT AATAATGTTG 600 

CTATGTCATG AGAATTATAA GGACACTATG TAAATGTAGC TTAATAATAA 650 

GGTTTGATTT GCAGAGATGC CACATCTTAC ATGGAACATT TGGTGAAAGC 700 

TCTAGAGAAA AAATGCGGGT ATGTTAACGA CCAAACCATC CTAGGAGCTC 750 

CATATGATTT CAGGTACGGC CTGGCTGCTT CGGGCCACCC GTCCCGTGTA 800 

GCCTCACAGT TCCTACAAGA CCTCAAACAA TTGGTGGAAA AAACTAGCAG 850 

CGAGAACGAA GGAAAGCCAG TGATACTCCT CTCCCATAGC CTAGGAGGAC 900 

TTTTCGTCCT CCATTTCCTC AACCGTACCA CCCCTTCATG GCGCCGCAAG 950 

TACATCAAAC ACTTTGTTGC ACTCGCTGCG CCATGGGGTG GGACGATCTC 1000 

TCAGATGAAG ACATTTGCTT CTGGCAACAC ACTCGGTGTC CCTTTAGTTA 1050 

ACCCTTTGCT GGTCAGACGG CATCAGAGGA CCTCCGAGAG TAACCAATGG 1100 

CTACTTCCAT CTACCAAAGT GTTTCACGAC AGAACTAAAC CGCTTGTCGT 1150 

AACTCCCCAG GTTAACTACA CAGCTTACGA GATGGATCGG TTTTTTGCAG 1200 

ACATTGGATT CTCACAAGGA GTTGTGCCTT ACAAGACAAG AGTGTTGCCT 1250 

TTAACAGAGG AGCTGATGAC TCCGGGAGTG CCAGTCACTT GCATATATGG 1300 

GAGAGGAGTT GATACACCGG AGGTTTTGAT GTATGGAAAA GGAGGATTCG 1350 

ATAAGCAACC AGAGATTAAG TATGGAGATG GAGATGGGAC GGTTAATTTG 1400 

GCGAGCTTAG CAGCTTTGAA AGTCGATAGC TTGAAC AC CG TAGAGATTGA 1450 

TGGAGTTTCG CATACATCTA TACTTAAAGA CGAGATCGCA CTTAAAGAGA 1500 

TTATGAAGCA GATTTCAATT ATTAATTATG AATTAGCCAA TGTTAATGCC 1550 
GTCAATGAAT GA 1562 
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<21<fc>-~i-i 

<211> 3896 

<212> genomic DNA 

<213> Arabidopsis thaliana 



<400> 11 

ATGGGAGCGA ATTCGAAATC AGTAACGGCT TCCTTCACCG TCATCGCCGT 50 

TTTTTTCTTG ATTTGCGGTG GCCGAACTGC GGTGGAGGAT GAGAC CGAGT 100 

TTCACGGCGA CTACTCGAAG CTATCGGGTA TAATCATTCC GGGATTTGCG 150 

TCGACGCAGC TACGAGCGTG GTCGATCCTT GACTGTCCAT ACACTCCGTT 200 

GGACTTCAAT CCGCTCGACC TCGTATGGCT AGACACCACT AAGGTCCGTG 250 

ATCTTCATTT CCTTCGCTCC TTATTCTGTC GGTCGAGTCA CTTGTTGATG 300 

AATTCCAAGC GAAATATAGC AATGAAGCAT GTCTCGTCTC TCTTATTGAT 350 

TCGTTCATTA GTCAACAGTG ACGCTTCTGA ATCTGAGTTT AGAGTCATAT 400 

AAAACAGCTG ACTCGGCGAG TGTTTCCCAT CGCTTTTGGT TCGCTAAATG 450 

TAGCGCAATG AATGTGTAAT TAGTCTGCGC TTTTTATTCA ACTAGATCTG 500 

CAAGTTTTTC AGAGTGCTCA ATAGTAGTTA GAAAATGTTA GGTCATTTTA 550 

CTTGTGCATT GTGATTCTTT TGGTTGTTGC TTACTGATCG ACGTGATGGA 600 

TGGTTTACAG CTTCTTTCTG CTGTCAACTG CTGGTTTAAG TGTATGGTGC 650 

TAGATCCTTA TAATCAAACA GACCATCCCG AGTGTAAGTC ACGGCCTGAC 700 

AGTGGTCTTT CAGCCATCAC AGAATTGGAT CCAGGTTACA TAACAGGTAG 750 

TTTCGGATTT TTCTTTCTTT TGAGTTTTCT TCAATTTGAT ATCATCTTGT 800 

TGTGATATAA TATGGCTAAG TTCATTAATT TGGTCAATTT TCAGGTCCTC 850 

TTTCTACTGT CTGGAAAGAG TGGCTTAAGT GGTGTGTTGA GTTTGGTATA 900 

GAAGCAAATG CAATTGTCGC TGTTCCATAC GATTGGAGAT TGTCACCAAC 950 

CAAATTGGAA GAGCGTG AC C TTTACTTTCA CAAGCTCAAG TTAGTCCTTA 1000 

TCAGGCTAAT GTCTTTTATC TTCTCTTTTT ATGTAAGATA AGCTAAGAGC 1050 

TCTGGTCGTC TTCCTTTTTG CAGGTTGACC TTTGAAACTG CTTTAAAACT 1100 

CCGTGGCGGC CCTTCTATAG TATTTGCCCA TTCAATGGGT AATAATGTCT 1150 

TCAGATACTT TCTGGAATGG CTGAGGCTAG AAATTGCACC AAAACATTAT 1200 

TTGAAGTGGC TTGATCAGCA TATCCATGCT TATTTCGCTG TTGGTACCGG 1250 

CCTACTATCC TTAAGTTACC ATTTTATTTT TTCTCTAATT GGGGGAGTTA 1300 

TGTTGTG AC T TACTGGATTG AGCTCGATAC CTGATTTGTT GTTGATTTAG 1350 

GAGCTCCTCT TCTTGGTTCT GTTGAGGCAA TCAAATCTAC TCTCTCTGGT 1400 

GTAACGTTTG GCCTTCCTGT TTCTGAGGTG ACCTCTGACT TCTCTTTAGT 1450 

TTTAAGTAGT TGATATCAAC CAGGTCTTAT AACTCACTGG ATTTTCCTTT 1500 

TGAAAGTATT ACTTTTGTTA ATTGAACTGC TGTACGCGAT ATGGTATCTG 1550 

TAGATCTTGA AGTGCTAGTT ATCAAAGAAC ATATTGTGGG TAGTATACCT 1600 

GTCAGCGGCC TTAGCTAATA CAACCAAACC ACATGTACAC TGATTTAGTT 1650 

TTCAGATTAT TATGGTAGAC TTTAAGTTGA GAAGAAACTT TGACTGAAAT 1700 

CTTTTTATTT TAATAGGCTA TGATTTGTTT ATTGAAATCA TGTGACATAT 1750 

TGACATGCGC TTCTCATGTT TTTTGTTGGC AAGGCTTCAG GGAACTGCTC 1800 

GGTTGTTGTC CAATTCTTTT GCGTCGTCAT TGTGGCTTAT GCCATTTTCA 1850 

AAGAATTGCA AGGGTGATAA CACATTCTGG ACGCATTTTT CTGGGGGTGC 1900 

TGCAAAGAAA GATAAGCGCG TATACCACTG TGATGAAGAG GAATATCAAT 1950 

CAAAATATTC TGGCTGGCCG ACAAATATTA TTAACATTGA AATTCCTTCC 2000 

ACTAGCGGTT AGACTCTGTA TATGCAACTG TAACACTAAC AAAAGTTTCA 2050 

CCAAGAATGT TCACTCTCAT ATTTCGTTCC TTTGATGTGT ATCCATCAGT 2100 

TACAGAAACA GCTCTAGTCA ACATGACCAG CATGGAATGT GGCCTTCCCA 2150 

CCCTTTTGTC TTTCACAGCC CGTGAACTAG CAGATGGGAC TCTTTTCAAA 2200 

GCAATAGAAG ACTATGACCC AGATAGCAAG AGGATGTTAC ACCAGTTAAA 2250 

GAAGTACGTA CCTTTCTTTG TGATAAGAAA TATTGCTCAT CGATCATCAC 2300 
TTGCTGGCTT CTTGTACGTC AAATTGTTTT GTTTAAATCT CTATATCAAT 2350 
TGTTCATATG CTTTGTCTTT CTTACTATAA GAAACAAGTA. TAATCAGAAA 2400 
CCTTATTATT GATTATCAGT TCTCTCCTTA TATTATGGAA TGTCTTTTTC 2450 
GTTTACAGTT ATGAATGCAA AAGGGGGTAT TTTAGTTGAT TGATTCTCTC 2500 
ATTCTCTAGT TTGTTTTGAC TAATAGCGTC AATTTTGTTT TTCTAGCAAA 2550 
TCTTTGTGAA TTATATATAA CATGCTAACT AT AC TTTTC A GGTTGTATCA 2600 
TGATGACCCT GTTTTTAATC CTCTGACTCC TTGGGAGAGA CCACCTATAA 2 650 

AAAATGTATT TTGCATATAT GGTGCTCATC TAAAGACAGA GGTATGATGC 2700 
ATTCTCAATA TCACATTATG CGTTGACTTT GTTATTATAT TCCCCATTTG 2750 
GTTTGCAATA TCTTTTTGAA TTATGATTTA TCTTCTCCCT TGCATCTTAT 2800 
GCTATTAAGC GTTAAAGGTA C T AAATGT AT GAAGCTGTCT GTCATAGGTT 2 850 
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GGTTATTACT TTGCCCCAAG TGGCAAACCT TATCCTGXTA AT.TGGATCKT 2*900 

CACGGATATC ATTTATGAAA CTGAAGGTTC CCTCGTGTCA AGGTAATTTT 2950 

CCGCAATGGC AGAAGTAAAA CAGGAAGGCA AAGTCTTCTG TATCAGTCTA 3000 

GTGGCATGTT ATCTCAGTTG CATAAGCAAA TTATTAAACA ACT AAAAT TT 3050 

AAGTACTTTT TTATCATTCC TTTTGAGCTT AGTGGATGAT CAGTGGCTTA 3100 

AAGTGGGAAG AGGTGTTGCA TGAAACATGA . CACTTGTATC AAAGATAACT 3150 

AGCAAAACAA AACTAACCCA TTTCTGAATT TCATATTATT AGGAGTAGTC 3200 

GTGCTTTTAA AAAATTTGTT TT AAGAAAC C GAAAAACTAG TTCATATCTT 3250 
GATTGTGCAA TATCTGCAGG TCTGGAACTG TGGTTGATGG GAACGCTGGA 3300 

CCTATAACTG GGGATGAGAC GGTAAGCTCA GAAGTTGGTT TTGAAATTAT 3350 

CTTCTTGCAA ACTACTGAAG ACTAAGATAA TACTTGCTTC TGGAACACTG 3400 

CTTGCTATGT TCTCTAGTAC ACTGCAATAT TGACTCTCCG CTACTTTTAT 3450 

TGATTATGAA ATTGATCTCT TATAGGTACC CTATCATTCA CTCTCTTGGT 3500 

GCAAGAATTG GCTCGGACCT AAAGTTAACA TAACAATGGC TCCCCAGGTA 3550 

CTCTTTTTTA GTTCCTCACC TTATATAGAT CAAACTTTAA GTGTACTTTT 3 600 

CTGGTTATGT GTTGATTTAC CTCCAATTTG TTCTTTCTAA AAATCATATA 3 650 

TCTCTGTACT CCTCAAGAAC TTGTATTAAT CTAAACGAGA TTCTCATTGG 3700 

GAAAATAAAA CAACAGCCAG AACACGATGG AAGCGACGTA CATGTGGAAC 3750 

TAAATGTTGA TCATGAGCAT GGGTCAGACA TCATAGCTAA CATGACAAAA 3800 

GCACCAAGGG TTAAGTACAT AACCTTTTAT GAAGACTCTG AGAGCATTCC 3850 
GGGGAAGAGA ACCGCAGTCT GGGAGCTTGA TAAAAGTGGG TATTAA 3896 
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<210>~ 12— - 

<211> 709 
<212> cDNA 

<213> Lycopersicon esculentum 
<400> 12 

CTGGGGCCAA AAGTGAACAT AACAAGGACA CCACAGTCAG AGCATGATGT 50 

TCAGATGTAC AAGTGCATCT AAATATAGAG CATCAACATG GTGAAGATAT 100 

CATTCCCAAT ATGACAAAGT TACCTACAAT GAAGTACATA ACCTATTATG 150 

AGGATTCTGA AAGTTTTCCA GGGACAAGAA CAGCAGTTTG GGAGCTTGAT 200 

AAAGCAAATC ACAGGAACAT TGTCAGATCT C C AGCTTTG A TGCGGGAGCT 250 

GTGGCTTGAG ATGTGGCATG ATATTCATCC TGATAAAAAG TCCAAGTTTG 3 00 

TTACAAAAGG TGGTGTCTGA TCCTCACTAT TTTCTTCTAT AAATGTTTGA 350 

GTTTGTATTG ACATTGTAAG TATTGCAACA AAAAGCAAAG CGTGGGCCTC 400 

TGAGGGATGA GGACTGCTAT TGGGATTACG GGAAAGCTCG ATGTGCATGG 450 

GCTGAACATT GTGAATACAG GTTAGAATAT TCAAATTATA TTTTGCAAAA 500 

TATTCTCTTT TTGTGTATTT AGGCCACCTT TCCCCGGTCA CAACGATGCA 550 

GATATGTATT CGGGGATGTT CACCTGGGAC AGAGTTGCAG ATTGAAGAGT 600 

TCTACATCTC ACATCCTGTC ACACTATGTG TGATATTTAA GAAACTTTGT 650 

TTGGCGGAAC AACAAGTTTG CACAAACATT TGAAGAAGAA AGCGAAATGA 7 00 

TTCAGAGAG .709 
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<210> 13 
<211> 623 
<212> PRT 

<213> Schizosaccharomyces pombe 
<400> 13 

MASSKKSKTHKKKKEVKSPIDLPNSKKPTRA 

FFFAVGDDNAVFDPATLDKFGNMLGSSDLFDDIKGYLSYNVFKD^ 

GYRSDHPVIMVFGVT SSGLESWSFNNCS I PYFRKRLWGSWSMLKAMFLDKQCWLEHLMLDKKTGLDPKGI2 1 0 
KLRAAQGFEAADFFITGYWIWSKVIENIiAAIGYEPNNMLSA 8 0 

IVHKKKVVLISHSMGSQVTYYFFKWVEAEGYGNGGPT^ 5 0 

TGIVITLNILEKFFSRSERAMMVRTMGGVSSMLPKGG 

DDALQFLKNVTDDDFKVMLAKNYSH^^ 9 0 

ERGYYYTNNPEGQPVIDSSVNDGTKVENGIVMD^ 

HEPAAFDLRGGPRS AEHVDI LGHSELNE I ILKVS SGHGDSVPNRY I SDIQEI INEINLDKPRN 623 
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<210> 14*" ~~ " ~ . 
<211> 432 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 14 

MKKI S SHYSWI AI LVWTMTSMCQ AVGSNVY PL I LVPGNGGNQLEVRLDREYKPS SVWC S SWL YP IHKK7 0 
SGGWFRLWFDAAVLLS PFTRCFSDRMMLYYDPDLDDYQNAPGVQTRVPHFGSTKSLLYLDPRLRDATSYM1 4 0 
EHLVKALEKKCGYVNDQTILGAPYDFRYGLAASGHPSRVASQFLQDLKQLVEKTSSENEGKPVILLSHSL2 1 0 
GGLFVLHFLNRTTPSWRRKYIKHFVALAAPWGGTI SQMKTFASGNTLGVPLVNPLLVRRHQRTSESNQWL2 8 0 
LPSTKWHDRTKPLVVTPQVNYTAYEMDRFFADIGFSQGWPYKTRVLPLTEELM^ 5 0 

TPEVLMYGKGGFDKQPEIKYGDGDGTWLASLAALKVDSLOT 
NYELANVNAVNE 432 
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< 210> 1 5 
<211> 552 
<212> PRT 

<213> Arabidopsis thaliana 



PCT/EP00/02701 



<400> 15 

MGANSKSVTASFWIAWFLICGGRTAVEDETEFHGDYSKLSGIIIPGFASTQLRAWSILDCPYTPLDFN70 
PLDLVWLDTTKLLSAVNCWKCMVLDPYNQTDHPECKSRPDSGLSAITELDPGYITGPLST^ 
VEFGIEANAIVAVPYDWRLSPTKLEERDLYFHKLKLT^ 
EIAPKHYLKWLDQHIKAYFAVGAPLLGSVEAIKSTLSGW 

KGDNTFWTKFSGGAAKKDKRVYHCDEEEYQSKYSGWPTNIINIEIPSTSARELADGTLFKAIEDYD 
RMLHQLKKTY^PFFVIRNIAHR 

KPYPDNWIITDIIYETEGSLVSRSGTVVDGNAGPITGDEWPYH^ 

KQQ P EHDG S DVHVELNVDHEHG S D 1 1 ANMTKAPRVKYITFYEDSES I PGKRTAVWELDKSGY 552 
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5 



10 



15 

<I70> patenfcln Ver. 2.0 

<210> 1Q- 
<211> 661 
20 <212> ?RT 

<213> Saccharomyces cerevisiae 

<400> i 



Thr Leu Phe Arg Arg Asn Val Gin Asn Gin Lys Ser Asp Ser 
25 1 5 10 

Asp fitt Asn Asn Lys Gly Gly Ser Val His Asn Lys Arg Glu Ser Arg 
20 25 
30 Asn His tie His His Gin Gin Gly Leu Gly His Lys Arg Arg Arg Gly 



35 



40 



lie Ser Gly Ser Ala Lys Arg Asn Giu Arg Gly Lys Asp Phe Asp Arc 



Lys Arg Trp 1 
65 " * " 70 75 



35 Lys Arg Asp Gly Asn Gly Arg Lys Arg Trp Arg Asp Ser Arg Arg Leu 





lie 


Phe 


He 


Leu 


Gly Ala Phe 


Leu 


Gly 


Val 


Leu 


Leu 


Pro 


Phe 


Ser 
95 


Phe 


40 










85 








90 














Gly Ala 


Tvr 


His 
100 


Val 


His Asn 


Ser 


Asp 
105 


Ser 


Asp 


Leu 


Phe 


Asp 
110 


Asn 


Phe 


45 


Val 


Asn 


Phe 
115 


Asp 


Ser 


Leu Lys 


Val 
120 


Tyr 


Leu 


Asp 


Asp 


n^r^o 
125 


Lys 


Asp 


Val 




Leu 


Pro 
13 0 


Gin 


Gly 


lie 


Ser Ser 
135- 


Phe 


lie 


Asp 


ASp 


lie 
140 


Gin 


Ala 


Gly 


Asn 


50 


Tyr 

- .i - 

— -S Zi 


Ser 


?hr 


Ser 


Ser 


Leu Asp 
150 


Asp 


Leu 


Ser 


Glu 
155 


Asn 


Phe 


Ala 


Val 


Glv 
ISO 


55 


Lys 


Gin 


Leu 


Leu 


Arg 
155 


Asp Tyr 


AST. 


lie 


Glu 
170 


Ala 


Lys 


His 


Pro 


Val 
175 


Val 




Mez 


Val 


Pre 


Gly 
180 


Val 


lie Ser 


mu — 


Gly 
135 


lie 


Glu 


Ser 


Tr*o 


Gly 
190 


Val 


Zie 


60 


Gly 


•> 


Asp 

1 


Glu 


Cys 


Asp Ser 


Ser 
2 00 


Ala 


His 


?he 


Arc 


Lvs 
205 


Arg 


Leu 
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Gl v Ser Pfce Tyr Met Leu fJ Me, Val Met Asp Lys Val Cys Trp 

5 Leu Lys His val «« Leu »P - «» Tto S ^ "° 

225 

Pae Tor Leu M U. U. «» P*e Glu ser Tar »P Tyr P|« tie 
10 Ha Gly Tyr Trp He Trp - Lys val PHe Gl. »- "V val ". 

260 Z&3 
Sly Tyr Glu ,re *sa Lys »« gj — fa! ** 

t,t « «, - «» «| « « is? ThI ws Leu Lys 

290 29 
20 Glu Gin He Glu Leu Phe His Gin Leu ser Gly Clu Lye val Cys Leu 
305 310 

Xle Gly Ki. Ser « Gly Ser Gin XI. g. - Tyr Ph. «er Lys Trp 
25 vel Gin M. Glu X Pro « Tyr Gly asn Gly Gly X. ? ly TrP Val 

Thr Leu Leu Gly 
Asn Glu Kis Xie Asp Ser Phe lie Asn Ala Axa Gly Bj- 

j0 ^ »i. T«i Tie Se^ Gly Glu Met Lys Asp Thr 

Ala Pro Lys Ala Val Pro Ala Leu xle Se. fcxy ^ 

370 3/3 _ 

35 lie Gin Leu «n Thr Leu *la Me, Tyr Gly Leu Glu Lys Ph. Ph. Ser 

"1 U. Glu « val Lys Her Leu Gin Thr Trp Gly Gly He Pre Ser 
405 

40 Her Leu Pro Lys Gly Glu Glu val XI. Trp Gly » «er Lys Ser 3er 
sar Glu Ta Leu asa *sn hsn Thr asp Thr Tyr Gly — Phe He 

455 

470 



45 435 

^ a Phe Asn Lys Asr- Leu Thr Met 
Arg Phe Glu Arg Asn Thr Ser Asp A_a 4gQ 
450 433 

Tlo t>ro Glu Trp Leu Gin 

50 Lys_ Asp Ala He Asr, Met Thr Leu Ser H- -e. 48Q 

. ... r ,.. ,vr Ser Phe Gly Tyr Ser Lys Asn Glu Glu 

Arg Arg Val His Glu 490 493 

435 

5* .... — « _„ V 4= t-o Ser Asn Pro Met: 

Glu Leu Arg Lys Asn ^--^ — _ n - =10 

500 3U3 

Glu val Pro Leu Pr= Glu ,1a Pro Kis « Lys He Tyr =ys He Tyr 

60 515 320 
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Gly Val Asn Asn Pro Thr C-lu Arg Ala Tyr Val Tyr Lys Glu Glu ASP 

530 535 
Aso ser Ser Ala Leu Asn Leu Thr He Asp Tyr Glu Ser Lys Gin Pro 
5 545 530 533 

Val Phe Leu Thr Glu Gly Asp Gly Thr Val Pro Leu Val Ala His Ser. 
5SS 370 
10 Met cys His Lys Trp Ala Gin Gly Ala Ser Pro Tyr Asn Pro Ala Gly 
580 585 
He Asn Val Thr He Val Glu Met Lys His Gin Pro Asp Arg Phe Asp 



595 



600 



15 He mi Gly Sly S«r U. «. His Val up H« L«u Gly Ser 

610 615 
- Ala Glu Leu Asn Asp Tyr lie Leu Lys He Ala Ser Gly Asn Gly Asp 
20 525 «0 6 - 3 

Leu Val Glu Pro Arg Gin Leu Ser Asn Leu Ser Gin Trp Val Ser Gin 
645 650 

25 Met: Pro Phe Pre Met 
660 

<210> 2CU 
30 <211> 387 
<212> PRT 

<213> Arabidopsis thai i ana 

<400> 2 t TlflTfl „ v _i r>- 0 G 1 v Asn Gly Gly 

35 Val Giv Ser Asn Val Tyr Pro Leu Xle Leu Val -o c._ 

1 5 

_ TiV <- orn Ser Ser Val 
Asn Gin Leu Glu Val Arg Leu Asp Arg »*u Tyr i,y o 

20 23 
Trp Cys Ser Ser Trp Leu Tyr Pro He His Lys Lys Ser Gly Gly Trp 



40 



35 



Phe Arg Leu Tro Phe Asp Ala Ala Val Leu Leu Ser Pro Phe Thr Arg 
45 50 55 

^ Asr> Pro Asp Leu Asp Asp 

Cvs Phe Ser Asp Arg Met Met ^eu Tyr ^ 80 
65 70 7 - 



r i- «-H- £-cr Val Pro Kis Phe Gly Ser 
50 Tyr Gin Asn Ala Pro Giy Val GU. v ^ 95 

35 - J 

^hr Lvs Ser Leu Leu Tyr ueu Aow > — * - * 0 

100 105 



Leu Tyr Leu As? Pro Arg Leu ^ ^ r 

00 

... _ , 7a , tvs ^ = t.- u Glu Lvs lys Cys Gly Tyr Val 

Tyr Met Glu His Leu Va^. ajl- *~ - . ?3 

115 120 

Asn Asp Gin Thr lie Leu Gly Ala Pro Tyr Asp ^ Arg Tyr Gly Leu 
60 130 133 
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Ala Ala Ser Gly His Pro Ser Arg Val Ala Ser Gin Phe Leu Gin Asp 
145 150 15 = 

Leu Lys Gin Leu Vai Glu Lys Thr Ser Ser Glu Asa Glu Gly Lys Pro 
5 165 170 

Val lie Leu Leu Ser His Ser Leu Gly Gly Leu Phe Val Leu His Phe 
180 185 i3U 

10 Leu Asn Arg Thr Thr Pro Ser Trp Arg Arg Lys Tyr lie Lys His Phe 
195 200 2U:> 

Val Ala Leu Ala Ala Pro Trp Gly Gly Thr He Ser Gin Met Lys Thr 
210 215 220 

Phe Ala Ser Gly Asn Thr Leu Gly Val Pro Leu Vai Asn Pro Leu Leu 
225 230 235 

Val Arg Arg His Gin Arg Thr Ser Glu Ser Asn Gin Trp Leu Leu Pro 
20 " 245 250 

Ser Thr Lvs Vai Phe His Asp Arg Thr Lys Pro Leu Val Vai Thr Pro 
" 260 265 

25 Gin Val Asn Tyr Thr Ala Tyr Glu Met Asp Arg Phe Phe Ala Asp He . 
275 280 285 

C-lv Phe Ser Gin Gly Vai Val Pro Tyr Lys Thr Arg Val Leu Pro Leu 
" 290 295 300 

30 "hr Glu Glu Leu Met Thr Pro Gly Val Pro Val Thr Cys He Tyr Gly 
305 310 315 

Arc Giv vai Asp Thr Pro Glu Val Leu Met Tyr Gly Lys Gly Gly Phe 
35 " " 325 330 335 

Ast. Lys Gin Pro Glu He Lys Tyr Gly As? Gly Asp Gly Thr Val Asn 
- y 340 345 350 

40 Leu Ala Ser Leu Ala Ala Leu Lys Val Asp Ser Leu Asn Thr Vai Glu 
355 330 35= 

lis Aso Giv Val Ser His Thr Ser He Leu Lys Asp Glu He Ala Leu 
370 " 375 380 



45 



Lys Glu lie 
385 



50 <210> 3 

<2il> 389 
<212> PHT 

<213> Arabicopsis thaiiar.a 
" l^lyl Lys Glu Gly Leu Lys Ala Lys His Pro Val Val Phe lie Pro 

-.-v ti a v =-. -hr G-v Giv Leu Glu Leu Trp Glu Gly Lys Gin Cys Ala 
60 """ ~~ "*20 " 25 30 
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Asp Giy Leu Phe Arg Lys Arg Leu Trp Gly Gly Thr Phe Leu Cys Trp 

35 40 
Val Glu His Met Ser Leu Asp Asn Glu Thr Gly Leu Asp Pro Ala Gly 
5 SO 55 

lie Arg Val Arg Ala Val Ser Gly Leu Val Ala Ala Asp Tyr Pile Ala 
65 70 75 

10 Pro Gly Tyr Phe Val Trp Ala Val Leu lie Ala Asn Leu Ala His He 

35 

Gly Tyr Glu Glu Lys Asn Met Tyr Met: Ala Ala Tyr Asp Trp Arg Leu 
100 105 ~ 



Ser Phe Gin Asn Thr Glu Arg Asp Gin Thr Leu Ser Arg Met Lys Ser 
.115 I 20 

Asn lie Glu Leu Met Val Ser Thr Asn Gly Gly Lys Lys Ala Val He 
20 130 135 140 

Val Pro His Ser Met Gly Val Leu Tyr Phe Leu His Phe Met Lys Trp 

145 150 i5 = 

25 Val Glu Ala Pro Ala Pro Leu Gly Gly Gly C-ly Gly Pro Asp Trp Cys 

Ala Lys Tyr He Lys Ala Val Met Asn He Giy Gly Pro Phe Leu Gly 
180 185 

30 Val Pro Lys Ala Val Ala Gly Leu Phe Ser Ala Glu Ala Lys Asp Met 
iss 200 205 

Arg Met Thr Arg Thr Trp Asp Ser Thr Met Ser Met Leu Pro Lys Gly 
35 210 215 220 

Glv Asn Thr He Trp Gly Gly Leu Asp Trp Ser Pro Glu Leu Pro Asr. 
225 " 230 233 

40 Ala Pro Glu Met Glu He Tyr Ser Leu Tyr Gly Val Gly He Pro Thr 

250 



45 



245 

Glu Arc Ala Tvr Val Tyr Lys Leu Asn Gin Ser Pro Asp Ser Cys He 
260 265 



Pro Phe Gin He Phe Thr Ser Ala His Glu Glu Asp Glu Asp Ser Cys 

275 280 2S = 

Le» Lvs Ai- ftiv Vai Tvr Asn Val Asp Giy As? Glu Thr Val Pro Val 

50 " 290 * * " 29= 300 

Leu Ser Ala C-ly Tyr Met Cys Ala Lys Ala Trp Arg Giy Lys Thr Arg 
305 310 

55 Phe Asn Pre Ser Gly He Lys Thr Tyr He Arg Glu Tyr Asn His Ser 

225 * 

Pro Pre Ala Asn Leu Leu Glu Giy Arg Giy Thr Gin Ser Gly Ala His 
340 345 «° 

60 Val Asp He Met Giy Asn Phe Ala Leu Ha Glu Asp He Met Arg Val 
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355 

Ala »la Sly Gly «n Gly Ser «, lie Gly ais M «» "al His S.r 
370 375 

Gly He P'ne Glu Trp 
385 



10 <210> 4CC 
<211> 1986 

<212> DNA . 
<213> Saccaaroinyces cerevisiae 

15 <220> 

<221> CDS 

<222> (1) • • (1983) 



20 



25 



<40Q> 4 t cag aac caa aag agt gat tct 48 

ate ggc aca ccg tec cga aga aat g.e cag ^ ^ Ser Ser 
Met Gly Thr Leu Pbe Arg- Arg Asr. va_ ^ 15 
1 5 



35 



40 



45 



35 



60 



gac gaa aac aat aaa ggg gj J* Jg -c aag cga gag age aga 
Aso Glu Asa Asr. Lys Gly Gly Ser Val Hxa asn 3Q 
20 

aac cac ace cac cat caa cag gga eta ggc cat aag aga aga agg ggt. 
Asn His He His His Gin Gin Gly Lea c_y ta y 4g 
35 40 

S S 35 S K S SS S5 S S5 J£ K SS K S 

SO 55 

aaa a S a B a= S?9 aaa gga «» J- «c ^ Jg « g JR 2S 
Lvs sxo Ast5 Gly Asn Gly Arg ±y s - 80 

"S5 " ' 70 _ 

act ttc att ctt ggt gca tec tta ggc gca ctt ccg ccg J« a*, fc^ 

He Phe He Leu Gly Ala Phe Leu Gly Val Leu u g _ 
85 yu 

t-^-rr fc*"£ CT2LC fiLfiLC tfct 

ggc gec cat cat gtc cat aat age gac age gac ccg ^ ^ ^ ^ 
Glv Ala Tyr His Vax His Asn Se-.As? be- *s*> 11Q 
100 103 

gaa aaa aaa gaa aca aaa aaa gag caa aag gaa gac g « «g •£ 
Val Asr. Phe Asp Ser Leu Lys vai ^ . * 12 5 

- cr=»- act cag get: ggt aac 

ccc cca caa ggc aca age teg t.. |a- Qln ^ Gly Asn 

Leu Pro Gm Gly iie Se_ s>=_ - 14Q 

130 13 = 

at ^ gc~ gt— gec 

tac tec aca ccc ccc cca gac gat cce age JJ»'» ^ V ai Gly 

^ Ser Thr Ser Ser Leu Asp Asp .eu Sar Glu As- -n igQ 

145 130 

«»= cc- aaa cac cct gtc gca 

aaa caa ccc cca cgc gac cat aac a gag cc ^ ^ Va] _ y&1 

Lys Gin Lau Leu Arg As? Tyr A**- -i 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 
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S76 



165 170 175 

atg gtt ccc ggt gte ate ccc acg gga act gaa age egg gga gte att 
Met Val Pro Gly Val He Ser Thr Gly He Glu Se_ Trp faxy ^ 
5 180 l Ss 

gga ga= gat gag tgc gat agt C=C gcg cat etc cgc aaa egg ccg tgg 624 
Gly Asp Asp Glu Cys Asp Ser Ser Ala His Pne Arg ^ 
i95 200 

10 gga agt ttt tac atg =tg aga aca atg gtt atg gat aaa gtt tgt tgg 672 
Gly Ser Phe Tyr Met Leu Ar| Thr Mec Val Met asp i.ys 

15 ttg aaa cat gta atg tta gat cct gaa aca ggt ctg gac cca ccg aac 
Leu Lys Kis Val Met Leu Asp Pro Glu Thr Gly Leu As? -o ?ro a^ 
225 230 

ttt acg cca cgt gca gca cag ggc ttc gaa tea act gat tat ttc ate 
20 Phe Thr Leu Arg Ala Ala Gin Gly Piae Glu Ser -hr Asp iyr ^ 



720 



768 



245 



S IVy SE rS S S SS J" fp £ = - S K 55 S 

260 253 

ggc tac gaa ccc aat aaa atg acg age get gcg tat gat tgg agg ccc 
Glv Tyr Glu Pro Asn Lys Met Thr Ser Ala Ala Tyr Asp ~rg 
275 280 

gca tat tta gat eta gaa aga cgc gac agg tac ctt acg aag cca aag 
Ala Tyr Leu Asp Leu Glu Arg Arg Asp Arg Tyr Phe Lys Leu uys 



815 



864 



290 " 2 » 5 



35 gaa caa ace gaa ccg ccc eat caa ttg agt ggc gaa aaa get cgc cca 
Glu Gin lie Glu Leu Phe His Gin Leu Sar Gly Glu Lys Va. cys 
305 310 313 

act gga cat ccc acg ggt tec cag act ace tec cac ccc acc aaa egg 
40 lie Gly His Ser Mec Gly Ser Gin lie jlj Phe Tyr Pns Me. Lys rrp 

gee gag gec gaa ggc ccc ccc cac ggc aac ggc ggc cgc ggc egg get 
Val Glu Ala Glu Gly Pro Leu Tyr Gly Asn Gly Gly A-g cxy -rp 
45 340 343 

aac gaa cac aca gac cca ccc ate aac gca gca ggg acg cct ctg ggc 
Asr. Glu His lie Asp Ser Phe lie Asn Axa Ala Gly *h- -au L,e_ ^y 
355 350 

50 r.~* ee *- c-= a— act ccc gaa ace aaa gat ace 

IS S "I S ?ia » S; & Sir fey Glu Me= Lys «p Thr 



960 



1008 



1056 



1104 



37C 37: 



55 att caa tta aac acg. tta gec acg tat ggc ^5 S?aa aa* 

He Gin Leu Asr. Thr Leu Ala Mec Tyr Gxy i-au «lu Lys n. 



385 390 



60 



„ _ - - -s » sec ccrcr ggc ggc aca cca -cz 

aga att gag aga gta aa„ a,, ~~ -c. ?ro Ser 

Arg He Glu Arg Val Lys Mec ue.i «ln Tier .r? u./ u - 41 - 
405 



1200 



1248 
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atg ~- cca aag gga gaa gag £e «- W «. ■« « » £ 
Mec Leu Pro Lys Gly Glu Glu Val lie ^ « ' * 430 
420 425 
5 tea gag gat gca ttg aat aac aac act gac aca ta= ggc aat ttc .« 
Ser Glu Asp Ala Leu Asn Asn Asn Tbr Asp iw > * 
435 440 
10 cga ttt gaa agg aat acg age gat get ttc aac aaa aat ttg aca ^ 
Arg Phe Glu Arg Asa Thr Ser Asp Ala Pae as y 

450 4 = 5 
aaa gac gee att aac atg aca tta teg ata tea cct gaa egg etc caa 
15 Lys Asp Ala lie Asn Met Tar Leu Ser lie se_ -r 4jJQ 
465 470 

aga aga gta cat gag cag tac teg ttc ggc tat tec aag 
A?g Arg Val His Glu Gin Tyr Ser Pae Gly Tyr Ser L,y ^ 
20 ~ 4 8= * 3 

gag tta aga aaa aat gag eta cac cac aag cac tgg teg aat cca atg 
Glu Leu Arg Lys Asn Glu Leu His His Lys His 1 - 51Q 
500 50b 
25 gaa gca cca ctt cca gaa get eec cac atg aaa acc tat tgt ata tac 
Glu Val Pro Leu Pro Glu Ala Pro His Me. Lys xx g » g 
515 520 
30 ggg gcg aac aac cca act gaa agg gee tat gta tat aag gaa gag gat 
Gly Val Asn Asn Pro Tnr Glu Arg aj.^ ly± 54Q 

gac ccc tec get =tg~ aac tag acc aac gac tac g. age aag caa act 
35 Asp Ser Ser Ala Leu Asn Leu Tor lie Asp ryr w, 5g(J 
545 550 

gca tec ccc acc gag ggg gac gga acc gtt ecg ccc gcg gcg cat tea 
Val Phe Leu Tiir Glu Gly Asp Gly Thr Val Pro Leu v ^ 
40 5S5 57 u 

acg tgt cac aaa tgg gee cag ggc get tea ecg tac aac cet gee gga 

Mec Cys His Lys Trp Ala Gin Gly Ala se *v 59Q 

580 533 

45 s = s s s a e s s hK a s g a s s 

595 5Q 

^ ffaa e*"- gta gac ate etc ggc age 

a-a cgt gg^ gga gca aaa age gee gaa c~ g ^ Leu Ql Ser 

lie Arg Gly Gly Ala Lys Ser Al* Glu .-is vai jsy 



1296 
1344 



1392 
1440 
1488 
1536 



1632 



1680 



1728 



1776 



50 



5d 



1872 



1920 



„ at- cca age ggt aat ggc gat 

rcg aag t-g aac gat tac a^w w- 3 - G ly Asn Gly Asp 

111 Glu Leu Asr. Asp Tyr lie ueu Lys -la Al- Se. y ^ 

525 « 30 

aat t" age cag tgg gtt tct cag 1968 

zzc gtc gag cca cgc caa ^g ^ £ - ^ Gin Trp Vai ser Gin 

Leu Val Glu Pre Arg Gin i»eu a — 633 



60 54 = 
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10 



1986 

atg ccc ttc cca atg taa 
Met Pro Phe Pro Met 
660 

<210> 5 fit, 
<211> 661 
<212> P5.T 

<2I3> S ac char omy ess cerevxsiae 



Me,°Gly Thr Leu Phe Arg Arg Asn Vai Gin Asn Glr. Lys Ser Asp Ser 

15 Aso Glu Asn Asn Lys Gly Gly Ser Val His Asn Lys Arg Glu Ser Arg 

20 25 
Asn His lie His His Gin Gin Gly Lau Gly His Lys Arg Arg Arg Gly 

20 lie Ser Gly Ser Ala Lys Arg Asn Glu Arc Gly Lys Asp Phe Asp Arg 
50 55 
Lvs Arg Asp Gly Asn Gly Arg Lys Arg Trp Arg Asp Ser Arg Arg Leu 
25 65 70 75 

He Phe He Leu Gly Ala Phe Leu Gly Val Leu Leu Pro Phe Ser Phe 
85 90 
30 Glv Ala Tyr His Val His Asn Ser Asp Ser Asp Leu Phe Asp Asn Phe 
100 105 
Val Asn Phe Asp Ser Leu Lys val Tyr Leu Asp Asp Trp Lys Asp Val 
115 120 
35 Leu Pro Gin Gly He Ser Ser Phe He Asp Asp He Gin Ale Gly Asn 
130 135 I 40 

Tyr Ser Thr -Ser Ser Leu Asp Asp Leu Ser Glu Asn Phe Ala Val Gly 
40 145 150 153 

Lys Gin Leu Leu Arg Asp Tyr Asn He Glu Ala Lys His Pro Val Val 

45 Met: Val Pro Gly Val He Ser Thr Gly He Glu Ser Trp Gly Vai He 
180 135 

Glv Asp Asp Glu Cys Asp Ser Ser Ala His Phe Arg Lys Arg Leu Trp 
^95 200 

50 Glv ser Phe Tyr Me,-Leu Arg Thr Me, Val Me, Asp Lys Vai Cys Trp 
" 210 215 ^ 20 

Leu Lvs His Vai Me, Leu Asp Pro Glu Thr Gly Leu Asp Pro Pre Asn 
55 225 230 

„. _ . . -,_ r i_ G -, v 5n» ciu Ser Thr As? Tyr Phe He 

Phe Thr Leu Arc AJ.a Ala Gi^ 255 

245 • "° 

60 Ala Gly Tyr Trp He Trp Asn Lys Vai Phe Glr. Asn Leu Gly Vai He 
1 r r\ 2oz 



250 253 
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Gly Tyr Glu Pro Asn Lys Met Thr Ser Ala Ala Tyr Asp Trp Arg Leu 
275 280 285 

Ala Tyr Leu Asp Leu Glu Arg Arg Asp Arg Tyr Phe Thr Lys Leu Lys 
290 " 295 300 

Glu Gin He Glu Leu Phe His Gin Leu Ser Gly Glu Lys Val Cys Leu 
305 310 315 320 

He Gly His Ser Met Gly Ser Gin lie He Phe Tyr Phe Met Lys Trp 
325 330 335 

Val Glu Ala Glu Glv Pro Leu Tyr Gly Asn Gly Gly Arg Gly Trp Val 
340 " 345 350 

Asn Glu His He Asp Ser Phe lie Asn Ala Ala Gly Thr Leu Leu Gly 
355 " 360 365 

Ala Pro Lys Ala Val Pro Ala Leu He Ser Gly Glu Met Lys Asp Thr 
370 375 380 

He Gin Leu Asn Thr Leu Ala Met Tyr Gly Leu Glu Lys Phe Phe Ser 
385 390 395 400 

Arg He Glu Arg Val Lvs Met Leu Gin Thr Trp Gly Gly He Pro Ser 
405 " 410 415 

Met Leu Pro Lys Gly Glu Glu Val He Trp Gly Asp Met Lys Ser Ser 
420 425 430 

Ser Glu Aso Ala Leu Asn Asn Asn Thr Asp Thr Tyr Gly Asn Phe He 
435 440 445 

Arg Phe Glu Arg Asn Thr Ser Asp Ala Phe Asn Lys Asn Leu Thr Met 
450 " 455 460 

Lvs Aso Ala He Asn Met Thr Leu Ser He Ser Pro Glu Trp Leu Gin 
465 " 470 475 480 

Ara Arc Val His Glu Gin Tyr Ser Phe Gly Tyr Ser Lys Asn Glu Glu 
435 490 495 

Glu Leu Ara Lvs Asn Glu Leu His His Lys His Trp Ser Asn Pro Met 
" 500 505 510 

G 1 " val °ro La" »ro Glu Ala Pro His Met Lys He Tyr Cys He Tyr 
515 520 525 

Giv Val Asn Asn Pro Thr Glu Arg Ala Tyr Val Tyr Lys Glu Glu Asp 
530 535 540 

aso 3e- n» r.* u is- Leu Thr He Asp Tyr Glu Ser- Lys Gin Pro 

545 ~ * 550 =55 560 

V»i ©h- Le<- ^hr G'u G"v Aso Giv Thr Vai Pre Leu Val Ala His Ser 
565 " " 570 575 

Me- cvs - J <s Lvs f"-a Al= Gin Giv Ala Ser Pro Tyr Asn Pro Ala Gly 
" *" 580 " 585 590 
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lie Asn Val Tnr He Val Glu Met Lys His Gin Pro Asp Arg Phe Asp 



595 600 
lie Arg Gly Gly Ala Lys Ser Ala Glu His Val Asp tie Leu Gly Ser 
5 610 615 

Ala Glu Leu Asn Asp Tyr Xle Leu Lys He Ala Ser Gly Asn Gly Asp 
625 



630 6" 



10 Leu Val Glu Pro Arc Gin Leu Ser Asn Leu Ser Gin Trp Val Ser Gin 

645 650 

Met Pro Phe Pro Met 
660 

15 
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stQtizucs: LISTING 

<110> Sryrwis Dr., Seen 
<120> 



<130> 

< 4.*U> 

c!50> 



<210> 1 

<211> I98S 

<212v eenomic DN7V* 

<2I2> Saccharomycss cerevisiae 

c220> 

<221> CDS 

<222* CI) . . (13B2) 



<< > QC> 1 >.~ 3 r cia aaa agt cac =cc 4Q 

ss= =- s T = r *« HI oln w ; _ w s -„- 

Met Gly Th.r Leu Pne Arn Ac 3 as.. , _ 

■ to X3 



5 10 



- M - ect etf cac aac aa= =3* 5 a= age 

sat eaa aac aa: aaa =22 25- ccc S" -* . - 

" _ • rW 0 „„. val His Asr. Lys Arg Glu Sc. 

As- Git ?jr. Asr. L'/= Cly G1 y v 30 



35 



20 25 



aac cac act cat cat caa cac gga eta S g= cat aa S aga aga agg ggt !~ 
His xi. Hi. His cm Gin Gly Lau Gly K l£ Ly S A--g Ar = iu« .ol. 

Art ^ 3 



23 « 



e-«- occ aaa aa: tec gac age i92 

.w- , a ~*- crca aaa aga aat ga = e 9 . Ss- * 

a.w a = - s = - a,,- gc - phft As? Arg 

-Is c-.- Glv c sr Lys Arc Am Gl- a-s 

~ 50 
50 5 ~ 

_ .„ „^ ta3 sea gat ccc aga aca ctg 240 
aaa aca gac ggg aac gge aga —a c — - ; Leu 

Lys ?.rg as? Gly Asn Gly Arg Ly» T-? - ^ 

70 13 



_ e „ ccc etc age "t 2S3 

x-: S; II: SI £ «i; ~ «» «■ «- s ~ «« 



3! 



a - a ~ -rac teg tt: cat aac 33S 

ccc c= = tat cat 2-= «- aa " * ^ " j Utt Phs As? M= 

Glv ^.la Tyr Kic Vai , 10 



100 
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c.a a*C Ctt S ac tea e.c .a, S e S Wl S~ *« »« ^ 

Val Asn **e Asp Scr Leu Lys Val Tyr Leu Ae? M? rrp Lys «, 



us 120 



• . „-.. t-c ac: cat car- act «g SS* «= 

eee c=a caa a S z aca a S t « s c.c ciy 

Lcu.Pro Gin Gly lie Ser Scr S« lie Asp Mp 



130 " S 



~ e -ec «• S a= eat «c agt. =aa aae C« ecc e>C S „t 

tac e== aca tec wee cc- S a - xla Val Gly 

Tyr ser Tivr Ser Ser Leu As? A,? ^ S,- Giu ^ 

145 3- so 

„_ ea - sec aas. ca: 

aaa caa c c «a e S t S ac » «c a ... p== ^ Val 

Lvs Gin Leu Leu Ar= Asp Tyr Asr. -le C.u a , 7S 

" "7 G — 



3S4 



432 



52 O 



its * 7C 



« fl sc= ccc 6 gt =tc atttcc acg s=- 9 - 
Me-, val Pre Gly Val lie Ser Thr Gly al. Glu . 

-IBS - 9 W 



130 



- =or t-r- sea ear cat aaa ere ctg egg 

s , a 5 a= 5 a S - S = =- J-« Lys ^ Lau w 

Gly AS? Asp Glu Cys AS? Se- Se. A-a 



57S 



S24 



135 200 



eta aC a at= g« atg gat aaa err r S _ -g= 

== a agi car ate eta a«a -C- S ^ cys ^ 

Giv ser Pb.e T-yr Met Leu Arg TO- Ke_ Va- «e - 



210 2i5 



S72 

l,ys Val cys Tr? 

220 

720 



. _. = cct «a aca «t =«=S S« — = 

aaa car get a.-., .-a = - ,„ pra ^ 

I.eu Lys His Val Mar Leu As? 5— C _. Q 



225 230 



.=c «. c S -. ,~ s « ». ss = ----- — :;- "'; *U 

.fc. XT.: Leu te. Hi ^" =^ 5i = " = ™ ? * 

« . 2 S 0 

24.3 



„ — ess. aat ctg gga- a ~ 

era SH = « c J" olT . - uS u Gly Val XI. 

Ala Gly Tyr Tr? lie Tr? Asn x,ys V~ — 2?o 



2S0 255 



i2 .r eat :== are = - - 

^ c = c a„ - z r ::: ^ r l ? ^ ^ 

Sly ryr C-iu Pre A S r. Lys Met Tr.r ^e.. — — - JJS 



752 



515 



= 54 
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„.n •« cc~ ea: sec tac ttt ae 3 aag eta aag 
gca cat tea gat eta gaa -9* eg ga S ^ ^ Ly= 

Ala Tyr Leu Asp Leu Clu Arg A. 5 200 



912 



290 295 



„ eaa c£ . tt - cat caa ttg agt age g*a aaa get tgt tta SSO 
gaa caa ate «aa et, tt. ^ yal Cy8 ^ . 

Glu Gin He Glu Leu Phe Kla Gin uau ^jr ^ 



303 310 



- MO act »r tec tac ttt acg aaa egg 
aec gga eat cec at S SSt ce- « - ^ 

He Gly Kia S« Met Gly sar Gin He lie M. y ^ 
32S 



wu aa" gg= egC e=t ggc egg sec 

gee gag get gaa ggc eet ett tac gg- «. gg - ^ 
vai Glu Ala Glu Gly Pro Leu Tyr Gly A-n Gly Gly ^ 



310 



. cc- at- eat cca gca ggg aec etc eeg g = - 

aac gaa cac ata gac wca cc- ^- - Lftu flly 

X=n Glu Hie He A=P Ser Phe Ha Aar. ~la Ala y 



33 = 



=ea c-e eta ntt agt ggt gaa acg aaa gat aec 

gee c=a aac gca S~ «a =- ^ M£C fcy , As? Thr 

Ala Pro Lys Ala Val Pro ax- * 3g0 
370 373 

B - ca «- ccc tat gge et S gaa aag tec ttc eea 

act caa eea *ac a= 5 gc ^ p . fte Ser 

lie Gin Leu Asn Thr Leu Ala Me. Tyr ^ 400 



xooa 



10Su 



1104 



1152 



1200 



3SS 390 



c?— t co: a:a cca tea 

-* "« « - £ =" S £ Sy Sr n. - '« 

Xro Zle Glu Arc Val Ly- We c - r 4 13 



405 



- ce .a caa cag cec att egg =5= S** ^ ia = tC& 
acg cca cca aag gga gaa g-, Ma= Lyg Sar Ser 

Mat Leu Pro Lys Gly Glu Glu Val Cr? c . 



425 

420 



- aae u - a— gac aca tac ggc aat ttc act 
tea gag gat gca ee= aa, aac aa_ a-- g ^ 
Ssr Glu Aap Ala Lau Aat Arr. Asr. Thr Asp Th_ Tyr y 



43 5 440 



_ a -== aac aaa aat teg aca a„g 

cca ttt gaa agg aat ae = a = . -~ L-u _„ Mst 

Arc Phe Glu Arc Arr- Thr Sar Asr Ala ?n- ^ Y 



1296 



13S2 



430 * SS 



. — - a-a "a c^ ff« ccr etc caa 1440 
aai gac ccc >:: " = ac ~ o D ro .Qlu Trr Leu Gin 

r, V5 xsp "* sn v 75 490 



470 
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aga aga ota cat gag cag tac teg tic gee cac ccc aag aac gaa gaa 14SG 
Arg Arc Val ?.i s Glu Gin Tyr Ser Phe Gly Tyr Ssr Lys f.sr. Giu Glu 
485 4 SC 455 

eae cca aga aaa aac gag eta cac cac aag cae tgs ccg aat cca acg 1S3S 
Glu Leu Are Lys Asa Glu Leu Kis Hi: Lys His Tr? Ser Asr. Pro Met 
" £00 50H "0 

„a eta e=a ctt cca gaa get ccc cac ate aaa acc tat t=t ata tac 1534 
Glu val Pro Leu Pro Glu Ala Pro Kis Ms: Lys lie Tyr Cys He Tyr 
£15 S20 S2S 

eae cce aae aae eea act gaa. agg g=* S" «t aag gaa gag gat 1G22 

Gly Val Ast. Asa Pro Thr Glu Arg Ala Tyr val Tyr Lys Glu Glu Asp 
£30 S3 S S40 



cae tec ccc get ctg aat ttg ace ate gac cae gaa age aag caa ccc 
L? scr Ser Ala Leu Asn Leu Thr Tie As? Tyr Glu Ser Lys Gin Pre 
£45 S30 SSS SS0 

eta ccc cte acc gag ggg gac gga ace gtt ceg etc gtg gcg cat tea 
val Phe Leu Thr Glu Gly Asp Gly Thr Val Pro Leu Val Ala His Ser 



£70 57S 



ate tec cac aaa egg gec cac get eec cca ccc cac aac est gec g=a 
Met evs Kis Lys Tr-e Ala Glu Gly Ala Ser Pro Tyr Asa Pro Ala Gly 

sso ' s« 590 

att aac etc act att gtg gaa atg aaa cae cag cca gat cga ctt gat 
He Asa val Thr He Val Glu Mat Lys :-:is Gin Pro As? Arg Phe As? 
= 5H 600 £0 = 

ata cgt =gc eea eca aaa age gee gaa eae gta gac ate cte gge age 
He Arg Gly Gly Ala Lys Ser Ala Glu Kis Val Asp He Leu Gly Ser 
610 SIS S20 

ccc oae tte aae aat tae ate ttg aaa att gea age get aat ggc eat 
Ala Glu Leu Asa Aso Tyr He Leu Lys He Ala Ser Gly Asa Gly As? 
<S2 = S30 S35 S4 ° 

etc etc cac cca cgc cat ttg ccc aat tee age cae tgg get act cae 
Leu. Val Glu Pre Arg Gla Leu Ser Asa Leu Ssr Gin Tr? Val Ser Gin 



IffSO 



172.8 



1776 



1824 



1872 



1520 



ISSi 



S45 



£50 e =- 
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<210» 2I7 
<211> 661 
<212> PRT 

<212> saccharomycsr cerevisiaa 



Asr Val Gin Gin Lys Sar Ac? * Ser 

Met Gly Thr Leu Pna Arg Ar 3 wr. va. ^ 

c X 0 

1 3 « - v-1 vis Asn Lys Are Glu Ssr Ar= 

Asp Glu A=r. Asn Lys Gly Gly Ser val xiis Y ^ 

Xlc kE His Gin Gin Gly Leu cly Kls x*. A--, *r S « Gly 
Xlc ser Gly Ser Ala Lys ~S « Olu ~ S Cly A s? ft." A-P — 

Lys a" Ar? Gly Asr. Gly a" LV* « «P ~* «~ **» ~ 3 ~ 

65 . «^ t nv Val Leu Lau Pro Phe Ser Phe 

lie Phe He Leu Gly Ala Phe Leu Gly val Le ^ 

35 90 
.1, ,yr tt. «1 — «P s « "» L " "0 "* 

v,, r.: s« « w. - £ M u w »j « 

XiS . tu! Tl- AS= AS= Xle Gin Ala Gly Ass 
Leu Pro Glr. Gly XX* Ser Ser Phe Xl ? - ^ 

130 " 5 i~u Ser Glu Asr. Phe Ala Val Gly 

Tyr Ser Thr Ser Ser Leu Asp A-P Leu S- ^ 

£ «. «. ,.u « «. " ; s15 ~ ™ v " 

K . s v., ,.-0 o» "I tx. - - «• «- ~ w lii 

130 m se - tie Kis Phe Ar= Lys Ar= Lau Tr? 

Gly As? Asp Glu Cys Asp Ser Se. Ala ^ 

~' 5 v .i m«- Asr Lys Val Cys Trp 

Gly Ser Phe Tyr «e-. Leu Arc Thr MeC Va- * 

210 v t»r-« <r>u Thr Giv Lau As? ?rc ?ra Asn 
Leu Lys His Val Met Leu Asp Pro G.u ^ ^ 

225 "° - w Giu Ser Thr AS? Tyr Phe Xle 

Phe thr Leu Arc Ala Ala Glr. Gly ?— 2£ _ 

243 , v - v.' 'he Glr. I-eu Gly Val Xle 

Ala Gly Tyr Tr? Xle Tr? Asr. Lys v»* .n. ^ 

2GC ?U Ala Tyr As? Tr? Ar= Leu 
Gly Tyr Glu Pre Asr. Lys Mas Thr Se- r. 1 J|g 

275 230 _ r-..- ? h= T hr Lys Leu Lys 

Ala Tyr Lau A2? Leu Glu Ars Xrg As? -/- 

250 295 e1v G -.j Lvs Val C%-= Leu 

Glu Glr. Xle Glu Leu Phe Sis Glr. Leu Se. G_, <~- - 



21= 

3 0= 
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~, c— nm He He Phe siie KcC LyS Trp 

life Glv His Ssr Met Gly Se= <=1* - ie r 335 

v., ol 'u M . ou. =" - «v «* " s TZ Ttp val 

M „ GVo Ki= xt°. M p .« ». - - «r ~ - «-» 

335 „, !!m ii- ser Gly Glu Mac Lys Asp 

Ala Pre Lys Ala val Pro Ala *•« rl - ser y 3go 

370 375 rw Lau Glu Lys Phe Phe Ser 

lie Gin Leu Asa Thr L*u Ala Mac Tyr Gly Leu ax y „„ 

Z Xle Clu Ar S val Lys - - Gin Thr Trp 1y .XV «. £ •« 

H.c Leu Pro Lys 4 G°iy Glu Glu val He Trp Oly Asp ^ Lys - - 

420 til » ao Thr Tyr Gly A=n Phe lie 

Ser Glu Aep Ala Leu Asn Asr. Asn Thr Asp Th- xy 

435 , Sa - Tap Ala Phe Asn Lye Asn Leu Thr K»t 

Arc Phe Glu Arg Asn Thr 9 a- Asp w ^ 

«50 455 T ,» «,»- pro Glu Trp Leu. Gin 

Lye Asp Ala He Asn Me, Thr Leu Ser He ... 

* ss 470 BV . „lv TV- Ser Lya Asn Glu Glu 

Ar = Val Kis Glu Gin Tyr Ser Phe Gly Ty- V ^ 

495 ^ t.u Hie y* s lys Kis Trp ser X« Pro Met 

Glu Lsu Arc Lys Asn Glu Leu Kis I * slo 

5 05 

300 » ~lu Ala Pro Ki= «et Lys He Tyr Cys He Tyr 
«1t, val Pre Leu Pro -lu Ala Pro ^ ^ 

Gly val Acn Asn Pre Thr Glu Ar 3 Ala Tyr Val Tyr 

330 !" ^r- aso Tyr Glu ser Lys Gin Pro 

Asp Ser Ser Ale Leu Asa Leu Thr He Asp tjt ssQ 

s * s SS ° „, , TH . val *ro Leu Val Ala Kis Ser 

Val Phe Leu Thr Glu Gly Aa? Gly Thr Val ,-o ^ 

I" m - C-1- Civ Ala ser Pro Tyr Asn Pro Ala Gly 
Met cys Kis Lys Trp Al~ C-l*. exy sg0 

S8 ° , „, w. ««s cm Pro Asp Arg Phe As? 

He Asr. val Thr He Val Glu Met. Lys — ffQS 

32 S e - all Glu Kis val Asp H« T - eu Gl V Scr 

He Arc Gly Gly Ala Lys Ser Ala Glu K. ^ 

S10 613 . Al£ c. r Gly Asn Gly Asp 

Ala Glu Leu Asa Asp Tyr He *-u L/- Xl- fi4Q 

S23 £2 ° x c„- G i r rro val Ser Gin 

^ t.-u Ser Asn Leu s— < — »- - 

Leu Val Glu Pro Ar= ^ 1 SS5 



= 30 

54 = 



ms: Pre Phe Pro Ka: 
650 
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<21C> 3 ix 
<211> 2312 
<212> csnoric DNA 

<213> Schizosaccharomyces .pombe 
c;oo> 3 

ATGGCGTCTT CCAAGAAGAG CAAAACTCAT AAGAAAAA3A AAGAAGTCAA SO 

ATCTCCTATC GACTTACCAA ATTCAAAGAA ACCAACTCGC GCTTTGAGTG 10 0 

AGCAACCTTC AGCGTCCGAA ACACAATCTG TTTCAAATAA ATCAAGAAAA 15C 

TCTAAATTTG GAAAAAGATT GAATTTTATA TTGGGCGCTA TTTTGGGAAT 2 00 

ATCCGGTGCT TTTTTTTTCG CTGTT GGAGA CGACAATGCT GTTTTCGACC 25 0 

: CTGCTACGTT AGATAAATTT GGGAATATGC TAGGCTCTTC AGACTTGTTT 3C0 

GATCACATTA AAGGATATTT ATCTTATAAC GTGTTTAAGG ATGCACCTTT 33 0 

TACTACGGAC AAGCCTTCGC AG T CT C CT AG CGGAAATGAA GTTCAAGTTG 400 

GTCTTGATAT GT AC AATG AG GGATATCGAA GTGACCATCC XGTTATTATG 45 0 

GTTCCTGGTG- TTATCAGCTC AGGATTAGAA AGTTGGTCGT TTAATAATTG 50C 

CTCGATTCCT TACTTTAGGA AACGTCTTTG GGGTAGCTGG TCTATGCTGA £5 0 

AGGCAATGTT CCTTGACAAG CAATGCTGGC TTGAACATTT AATGCTTGAT £00 

AAAAAAA CCG GCTTGGATCC G AAG GG AATT AAG CTGCG AG CAGCTCAGGG 650 

GTTTGAAGCA GCTGATTTTT TTATCACGGG CTATTGGATT TGGAGTAAAG 7 00 

TAATTGAAAA CCTTGCTGCA ATTGGTTATG AGCCTAATAA CATGTTAAGT 7SQ 
GCTTCTTACG ATTGG CGGTT ATCATATGCA AATTTAGAGG AACGTGATAA S0 0 

ATA" x X T re A AAGTTAAAAA TGTTCATTGA GT AC AG C AAC ATTGTACATA 8 50 

AGAAAAAGGT AGTGTTGATT TCTCACTCCA TGGGTTCACA GGTTACGTAC 9 00 

•TATTTXTTTA AGTGGGTTGA ACTJVLVAGGGC TACGGAAATG GTGGACCGAC OSO 

TTGGGTTAAT GATCATATTG AAGCATTTAT AAATGTGAGT CTCGATGGTT 10 00 

GTTTGACTAC GTTTCTAACT TTTGAATAGA TATCGGGATC TTTGATTGGA 10 5 0 

CCACCCAAAA CAGTGGCAGC GCTTTTATCG GGTGAAATGA AAGATACAGG 110 0 

TATTGTAATT ACATTAAACA TGTTAATATT TAATTTTTGC TAACCGTTT7 11S0 

AAG CT C AA.TT GAATCAGXTT TCGGTCTATG GGTAAGCAAT AAATTGTTGA 12 00 
GATrttiTT&C TAATTTACTG TTTAGTTCGC AAAAATTTTT TTCCCCTTCT 
GAGGTATATT CAAAAATACA AATGTGCTC? ACTTTTTCTA ACTTTTAATA 
CAGAGCCATG ATGGTTCGCA CTATGGGAGG AGTTAGTT CT ATGCTTCCTA 13 50 

AAGGAGG CG A TGTTGTATGG GGAAATGCCA * GTTGGGTAAG AAATATGTQC 140 0 
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TGTTAATTTT TTATTAA.TAT TTAGGCTCCA G ATG AT CTT A ATCAAACAAA 
TTTTTCCAAT GGTGCAATTA TTCG AT AT AC AGAAGACATT GATAAGGACC 
ACGATGAATT TG ACATAGAT GATGCATTAC AATTTTTAAA AAATGTTACA 

^.vs-cxt GCTAGCGAAA AATTATTCCC ACGGTCTTGC 
GATGACGATT TTAAAG*C«T G 

, tkaaJAXTAA CGAAATGCCG TCTAAATGGA 

TTGGACTGAA AAAGAAGTGT TAAAAAATA*. 

— ,.itr-» M ' ACATTAAAGT TACTAAATTA TACTAACCCA 
TAAATCCGCT AGAAGt AAO-- fcCMiww 

WM XCX«, XC^C=TT« =CX==TC A T* TTGCSTTCR.C 
CM«TC«AA 0«>A==«« M*M=C««X 

rAA TGATGGAACA AAAGTTGAAA 

GGGGCAACCT GT C ATTG ATT CCTU3GTTAA raw 

m m „ _M*iM< r-r-TAACTGTT TTATTAGGGT 
ATGTGAGAGA ATTTATGTTT CAAACATTCT -T^Ci. 

AJLC^TTA CCAATATTAG CCCTTGGTTT 

ATTGTTATGG ATGATGGTGA xw^AACTXT~ 

cctctcc^t amotttmc =~™«cr ==T«T«« 

««««» TT«C^T= «««« C««aC«T. 

„„, ga/** s T A.CTTGGAC ATTCAG AG CT 

GGAGGACCTC GCTCGGCAGA ACACvr.wGA- 

«X TC ^T T=^«C TTACfcAATTT CTCTTO»« 
TTTT^T ™,t M =C C«^CZ 0»«» 
C*m*** TCMKKNe ««««« 
TOMTAACT* .aMJAATM. »T»<>»*» TCT«««* 

CCT AG AAATT AA 



1450 
15Q0 
1S50 

1600 

IS SO 

1700 

17S0 
1300 

iaso 

1900 

1950 

2000 

2050 

2100 

2150 

2200 

22S0 

2300 

2312 
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<2iQ> */r 

<212> genomic DNX . . ary ^ 
<400> 4 

TTCATCOO^ W«CC=«0 

0 x= STC =- »-«» — — 
CTO c^= — ° — ^ ST ^°: G ; 

„=™ ™ «. ««— «« ««— * 

>XAGAAGGTG TTAAGGCGAA ACATCCTGTT G^CA 

c^cacxcco CTCGAGCTTT gggaagg c aa ac^ccgcx g^ttta 
^ GSAAC rrrrG gtgaagtcta caaaaggtga 

VrAGAAAACG TTTGTGGCGT GGAACTTTI 

rrTT - ATWT GGGATTTGGA TTGGATCTGA 
CCTCAACAAT TCTCACTCTT CCTTTATAa - 

„_,,,-,. tCACTCAAAC TTTA.--TTCCA 

TGAGATCACG CACTTGTTGC TTCTTCP 



ATTTTCVTAA ^ - _ a G ~ TA TAAAA 

-_ GTTXC x~ GTTAACT.TTA GoA^T^w 
GTAATTGGCT TGGACTATTT CTGTT^G- 

TAGCTGCCTT GGAATTTCAA GTCAw.^ 

n^r-r-^ TTCCTTTAC- GTCGTTGCG j. 

ATTGCCCTAGA GTCCGTXCAT AACAAGTTA- 

™^ » <— rftGTGTTT TATGTTTTGT 

otmmti*. crrr^T^ c=x««.ra„ 

™. CT-CATCTST OOAW.OTaTG ' 

«»«««» . raTAIS , ««««« 

ctmttcctt modern: c«=t=m.t 0 tgcittt - 

' „ „ v C =M~T f .TT C*CATOT=TT OMTtlC... 

TC^TTTC^ . _ CCTCCC3XC C 

CXTCCTCSTT CT3T7A.TCCA OCtTW-*- . „.. 

™« — r: ; ^« 

, CAGTGSTATA S*w^ 

^m^ci en-Mr* «-»'-*• »- 

_ g .~cmctg «x«««* xca*c»- 



50 

100 

150 

200 

250 

300 

350 

400 

450 

500 

550 

goo 



XCTTQii^ * 

rrTCT - -~- r^TCTT TACTTTTTTT TTTTTTTGAT GTGA^CGCX 
XGTTTGTCTG r.TTACT TTCCAAGGAC 



700 
750 
500 
£50 
900 
350 
1000 
1050 
1100 
1150 
1200 
1230 
1300 
1230 
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XCAG0ACTCC3 XGGCTCCTGA CT ACXTTG CT CCXGGCTACT XTGXC 7 G=GC 
^GCTGAXT GCXAACCXXG CACATAXXGG • AXAXGAAGAG ftAAAAXATGX 
AC^GGCXGC ATAXGACXGG CGGCXXXCGX HC^C AGAGGXXCXX 
TTCTCAXCGX XCXXXCXAXX ATXCXGXXCC AXGXXACGXX TCXXXCXXCA 
XTACTTAAGG CXXAAATAXG TXXCATGXXG AA?T AAX AGG 
CACTCTTAGC C*XATGAAAA — , CXTGAXGGXX TCXACCAACG 

gTT CCGC AXX CCATGGGCGT CTTGXAXXXT 
GXGGAAAAAA AGCAGTXATA GXTCCGwaa i, 

_ ^-. r - CAC CX GCTCCTCTGC GXGGCGGGGG 
CTACAXXXXA TGAAGTGGCT T-AG^CACCA O*-* 

, ™==~ »«•— 

c^rrTcr — m ctctgctgaa 

cc««=«= AT " xTtT °: 

-XVCTCAAAG XTAXXCTACT AAATAXCAAT 
TGATCAGAAC CTTGGCTCTG G-ACXUW*. 

_. T _ TT A TCGCTGCAA CTGACAXTGG TTGAXXAXXT 

TCTAATAACA TTGCTM^i 

XGTAACXGAA ACXCTCXXGA GATTAGACAA AXGAXGAAX* 
GATAATTCTT ACGCAXXGCT cxgxgatgac.cgxxxcxxa gcxxcgacga 

__, JTCT TXXGGAGGGC ATTGAATTTT GCTAXG-AAA 

TAA.CAXXXGX CATACXwTCT ITT^ 

~^-G-XT GCAXTCXXXA CCAATTAGCG XTAXXCTGCX 
GCGCTGGAGC TTCCAi.GwTT 

_ TGCATCTATG GXCXXXTAXT TCTTCTTAAT 

TCTTTCAATT TXCTTGXAXA iGCMdA^ 

— * GTCACTTGCST «^~- 

^CTXT« XTTCTXC^ «««« 
X^,CT ««« «~ 

=— — * °= T ™ 
XATGGGGCGG GCTTGAXXGG TCACCGGAGA A~GG~~ CAC 

„,,„, raAGCAJSOTG XAAACGGAC7I 

AAAAAGCAAA AGAACAACGA AACTTGXG^ GAAGCAG 

ACTAXGGAAG GAXGAXATCT XXTGGGAAAG 
TTCCAAGAAA MTCCTu^A ACTATt»~ 

„ AXAATAXTGA TTTTCGAG * A 

AAGTftGCAGA CGCTGCGCCA TCT-nS-w*- Ax. 

ATTTTGTGAT TGXAXG«X«~ 

ASG AC AX ATA AATCATAATA AAC-.Tw-X— 

„„: ****** — 

, ,„ s GAGXACCATG ACAXG— — 

CCSAATCACA CCTGTCGTGA CSXGX—^ 

r ~,- TX TAA GGXCTACACT GCXGGTGAnG 
7GCTGGGAXC AAACCXAXCC C 1Un »^ 



25S0 

2700 

2750 

2300 
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CTATAGATCT ACTACATTAT GTTGCTCCTA AGATGATGGC GCGTGGTGCC 
GCTCATTTCT CTTATGGAAT TGCTGATGAT TTGGATGACA CCAAGTATCA 
AGATCCCAAA T ACTG GT CAA ATCCGTTAGA GAGAAAGTAA GTGATTTCTT 
GATTCCAACT GTATC CTTCG TCCTGATGCA TTATCAGTGT TTTTGTTTTC 
GGTCTTGTTG GATATGGTTT TCAGCTCAAA GCTTACAAAG CTGTT7CTGA 
GCCTTTCTCA AAAAGGCTTG CTCAGTAATA TTGAGGTGCT AAAGTTGATA 
CATGTGACTC TTGCTTATAA ATCCTCCGTT TGGTTTGTTC TGCTTTTTCA 
GATTACCGAA TGCTCCTGAG ATGG AA ATCT* ACTCATTATA CGGAGTGGGG 

„ r „ 1T , cGTATACAAG CTTAACCACT CTCCCGACAG 
ATACCAACGG AACGAGCATA CCTATACWw 

' ' XTGCATCCCC TTTC AG AT AT TCACTTCTGC TCACGAGGAG GACGAAGATA 
GCTGTCTGAA AGCAGGAGTT TACAATGTGG ATGGGGATGA AA=A=TA==C 
GTCCTAAGTG CCGGGTAGAT GTGTGCAAAA GCGTGGCGTG GCAAGACAAG 
ATTCAACCCT TCCGGAATCA ACACTTATAT AAGAGAATAC AATCACTCTC 
GGCCGGCTAA CCTGTTGGAA GGGCGCGGGA GGCAGAGTGG TGCCCATGTT 
GATATCATGG GAAACTTTGC TTTGATCGAA GATATCATGA GGGTTGCCGC 
CGGAGGT AAC GGGTCTGATA TAGGACATGA CCAGGTCCAC TCTGGGATAT 
TTGAATC-GTC GGAGCGTATT GACCTGAAGC TGTGA 



3900 

2950 

3000 

30SO 

3100 

31S0 

3300 

3250 

3200 

33S0 

3400 

3450 

3500 

3550 

2SG0 
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<210> 5^ 
<211> 402 
<212> CDMA 

<213> Arabidopais chaliaaa 
<220> 
<221> CDS 

•<222> C120) . . (402) 

<4oa> s 

AGAAJ\CAGCTCTTTGTCTCT CTCGACTGATCTAACi^TCC CTAATCTGTGTTCTAAATTC 

enucawTTiu^ tccmtat^cttaacctggt ttaatttcxagtgacagat 

_ n — — r *j5 JLPlA CCA TCG ACG CCG 

ss ss es s si ^ s: « s ss ^ » »» « 
e is si iz ss - k ss - ss s s~ s: £ - 

XCC «» T=C CA= OT »» «« *- « -* ~ « ~ 
Glu Sfir Ser Lys Ssr his His Lys 7ff 

-T—r-p AT ? GGG TGT GTG TGT GTA ACC 
IS SS 55 sS SS™ S ?5 ly C3 V.X Cy , V.1 «ur 

tgg tgg ttt err crc ttc ctt tac £c gca atg cct gcg aoc ttc cct 

Tr? Tro Pr.e Leu Leu Phe Leu Tyr Asn Ale. Mat »r 
C M » =TX *=G 3,C CCS »T CA= =KC T=C TTT CCC XT* OCC . 
• Gin Tyr Val Thr Glu Pro Asr. His ??? ser ~ 



60 

119 

1(S7 

215 

• 311 
259 
402 
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<2io> sir 

<211> 643 

<232» CDMA 

<213» 2eft mays 

<220> 

<221> COS 

<222> (1) . . (402) 

<4Q0> 6 

CGG GAG AA* ATA GCT GCT TTG AAG GGG GGT GTT TAC TTA GCC GAT GGT 4 8 
Airs Glu Lyc lie Ala Ala Leu Lys Oly Gly Val Tyr -Leu Ala Asp Gly 
I Y ' 5 10 15 

GAT GAA ACT GTT CCA GTT CTT AGT GCC GGC TAC ATG TGT GCG AAA GGA S6 
As^ Glu Thr Val Pro Val Leu Ser Ala Gly Tyr Mez Cys Ala. Lys Gly 
20 25 30 

TGG CGT GGC AAA ACT CCT TTC AGC CCT GCC GGC AGC AAG ACT TAC GTG 144 
Tra Arc Gly Lys Thr Arg Phe Ssr Pre Ala Gly Ser Lys Thr Tyr Val 
35 40 « 

AGA GAA TAC AGC CAT TCG CCA CCC TCT ACT CTC CTG GAA GGC AGG GGC 192 
Ar = Glu Tyr Ser His Ser Pro Pro Ser Thr Leu Leu Glu Gly Arg Gly 
SO S5 SO 

ACC CAG AGC GGT GCA CAT GTT GAT ATA ATG GGG AAC TTT GCT CTA ATT 2 40 
Thr Gin Ser Gly Ala His Val As? lie Met Gly Am Phe Ala Leu lie 
65 70 75 SO 

GAG GAC GTC ATC AGA ATA GCT GCT GGC GCA ACC GGT GAG GAA ATT GGT 2SS 
Glu Asd Val Tie Arc lie Ala Ala Gly Ala Thr Gly Glu Glu He Gly 

.85 20 95 

GGC G£T CAG GTT TAT TCA GAT ATA TTC AAG TGG TCA GAG AAA ATC AAA 3 36 
Glv Asn Gin Val Tyr Ser Asp Xle ?hs Lys Trp Ser Glu Lys Xle Lys 
100 105 HO 

TTG AAA TTG TAA CCTATGOGAA GTTAAAGAAG TGCCG ACCCO TTTATTGCGTTCC 2 21 

Leu Lys Leu 
115 

AAAGTGTCCT GCCTGAGTGC AACT CTG GAT TTTGCTTAAA TATTGTAATT TTTCACCC 443 

TTCATTCGTC CCTTTGTCAA ATTTACATTT GACAGGACGC CAATGCGATA CG ATG TTG 5 07 

TACCGCTATT TTCAGCATTG. TATATTAAAC TGTACAGGTG TAAGTTGCAT TTGCCAGC 5<S5 

TGAAATTGTG TAGTCGTTTT * CTTTACGATT TAATANCAAG TGG CGG AGC A GTGCCCCA S2 3 



AG CN AAAAAA AAAAAAAAAA 64 J 
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<aio» if* 
<an> us 

<313> Zea. mays 
<400> 7 



Arg Glu ly. He Ma Ma Leu Lys Gly Gly Val Tyr Leu Ala Asp ciy 

1 s 10 

Aso Glu Thr Val P=o Val Ma Ser Ala Gly Tyr Met Cys Ala Lys Gly 
20 iS 

Trp *rg Gly Lys Thr Arg Phe Ser Pro Ala Gly ser Lys Thr Tyr Val 
Ars Glu Tyr Ser Ki* Ser Pro Pro Ser Thr Leu Leu Glu Gly Arg Gly 



50 



S3 



Thr Gin ser Gly Ala His Val Asp He M« Gly Asa Phe Ala Leu lie 
S3 70 7 



Glu Asp 



sp Val lie Ars Xle Ala Ala Cly Ala Thr Gly Glu Glu XX. Gly 



85 



— t oh^ T~t> Ser Glu L.ys He Lys 

Gly Aso Gin Val Tyr Ser As? xle Phe Ly- * ^ 

100 105 



Levi Lys Leu 
IIS 
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<2io> aj^ 

<211> 616 
<212> cDNA 

<Z13? Kcuxospora crasaa 
<40Q> 3 



GGTGG CG AAG 


ACGAMGGCGG 


AAGTTGGAGG 


CTAACGAGAA 


TGACKCTCGG 


50 

• 


AGATGGATCT 


ACCCTCTAGA 


GACACCACTA 


CC5TTTGCACC 


CAGCCTCAAG 


ioo 


GT^rTACNGTT 


TNTATGGGTA 


GGAAGCCGAC 


GGAGCGAGCC 


TACATCTATC 


ISO 


TGGCGCCCGA 


TCCCGGGACG 


ACAACGCATC 


TTTAGATGAC 


GATCGATACG 


200 


ACTTTGACTN 


AGGGGCACAT 


TGACCACGGT 


GTGATTTTGG 


GCGAAGGCGA 


250 


TGGCACAGTG 


AACCTTATGA 


GTTTGGGGTA 


CCTGTGGAAT 


AAGGCGYGGA 


aoo 


AAATGAAGAG 


ATACAATCCT 


GCGGGCTCAA AAATAACCGT 


GGTCGAGATG 


250 


CCGCATGAAC 


CAGAACGGTT 


CAATCCGAGA 


GGAGGGCCGA ATACGGCGGA 


500 


CTTAAATATG 


TAGAAAAGGT 


TG AAATTT AT 


GAAGAGTAAT 


TAAATACGGC 


sso 


ACATAGGTTA 


CTCAATAGTA 


TGACTAATTA 


AXAAAAAATT 


TTTTTTCTAA 


GOO 


AAAAAAAAAA 


AAAAAA 








516 
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<210> 9jLr 

<211> 1562 

<212> genomic DMA 

<213> Arabidopsis thai i ana 

<400> 9 



ATGAAAAAAA TATCTTCACA TTATTCGGTA GTCATAGCGA- TACTCGTTGT. SO 

GGTGACGATG ACCTCGATGT GTCAAGCTGT GGGTAGCAAC GTGTACCCTT 10 0 

TGATTCTGGT TCCAGGAAAC GGAGGTAACC AG CTAGAGGT ACGGCTGGAC ISO 

AGAGAATACA AGCCAAGTAG TGTCTGGTGT AGCAGCTGGT TAT AT CCG AT 20 O 

T C AT AAG AAG AGTGGTGGAT GGTTTAGGCT ATGGTTCGAT GCAGCAGTGT 2S0 

TATTGTCTCC CTTCACCAGG TGCTTCAGCG ATCGAATGAT GTTGTACTAT 3 00 

GAcr.frrna.TT tgcatcatta ccaaaatgct cctggtgtcc aaacccgogt 

TCCTCATTTC GGTTCGACCA AATCACTTCT ATACCTCGAC CCTCGTCTCC 
GGTTAGTACT TTCCAAGATA TATCATTTTG GGACATTTGC ATAATGAACA 
AAATAGACAT AAATTTGGGG GATTATTGTT ATATCAATAT CCATTTATAT 500 
GCTAGTCGGT AATGTGAGTG TTATGTTAGT ATAGTTAATG TGAGTGTTAT 
GTGATTTTCC ATTTTAAATG AAGCTAGAAA GTTGTCGTTT AATAATGTTG 
CTATGTCATG AGAATTATAA GGACACTATG TAAATGTAGC TTAATAATAA 
GGTTTGATTT GCAGAGATGC CACATCTTAC ATGGAACATT TGGTCAiACC 
TCTAGAGAAA AAATG CGGGT ATGTTAACGA CCAAACCATC CTAGGAGCTC 
CATATGATTT CAGGTACGGC CTC-GCTGCTT CGGGCCACCC GTCCCGTGTA 
GCCTCACAGT TCCTACAAGA CCTCAAACAA TTGGTGGAAA AAACTAGCAC- eSO 
CGAGAACGAA GGAAAGCCAG TGATACT CCT CTCCCATAGC CTAGGAGGAC 
TTTTCGTCCT CCATTTCCTC AACCGTACCA CCCCTTCATG GCGCCGCAAG 
TACATCAAAC ACTTTGTTGC ACTCGCTGCG C CAT GGGGTG GGACGATCTC 
TCAGATGAAG ACATTTGCTT CTGGCAACAC ACTCGGTGTC CCTTTAGTTA 
ACCCTTTGCT GGTCAGACGG CATCAGAGGA CCT CCG AG AG XAACCAATGG 
CTACTTCCAT CTACCAAAGT GTTTCACGAC AGAACTAAAC CGCTTGTCGT 
AACTCCCCAG GTTAACTACA CAGCTTACGA GATGGATCGG TTTTTTGCAG 

rTT ~~~<~ nc?ASACAAG AGTGTTGCCT 
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TTAACAGAGG AGCTGATGAC 
GAGAGGAGTT GATACACCGG 
ATAAGCAACC AGAGATTAAG 
GCGAGCTTAG CAGCTTTGAA 
TGGAGT IT C G CATACAT CT A 
TTATGAAGCA GATTTCAATT 
GTCAATGAAT GA 



TCCGGGAGTG CCAGTCACTT 
AGGTTTTGAT GTATGGAAAA 
TATGGAGATG GAGATGGGAC 
AGTCGATAGC TTGAACACCG 
TACTTAAAGA CGAGATCGCA 
ATTAATTATG AATTAGCCAA 



GCATATATGG ±2 00 

GGAGGATTCG i 3S0 

GGTTAATTTG 14 0 0 

TAGAGATTGA 1430 

CTTAAAGAGA i 500 

TGTTAATGCC . 15 5 0 

1562 
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<21Q> lojk' 
3S9S 

<212> gcnorr.ic DXA 

<213> Arabidopsis thali&rva 

<400> 10 



ATGGGAGCGA 
TTTTTTCTTG 
TTCACGGCGA 
TCGACGCAGC 
CG ACTTC AAT 
ATCTTCATTT 
AATTCCAAGC 
TCCTTCATTA 
AAAAC AG CTO 
TAGCGCAATG 
CAAGTTTTTC 
CTTGTGCATT 
TGGTTTACAG 
TAGATCCTTA 
AGTGGTCTTT 
TTTCGGATTT 
TGTGATATAA 
TTTCTACTGT 
GAACCAAATG 
C AAVTTGG AA 
TCAGGCTAAT 
TCTGGTCGTC 
CCGTGGCGGC 
TCAGATACTT 
TTGAAGTGG C 
CCTACTATCC 
TGTTGTGACT 
GAGCTCCTCT 
GTAACGTTTG 
TTTAARTariT 
1TGAAAGTATT 
TAGATCTTGA 
GTCACCGGCC 
TTCAGATTAT 

cxrrrTATTT 

TGACATGCGC 
GGTTGTTGTC 
AAG AATTG C A 
TGCAAAGAAA 
CAAAATATTC 
ACTAGCGGTT 
CCAAGAATGT 
TACAGAAACA 
CCCTTTTGTC 
G C AAT AG AAG 
GAAGTACGTA 
TTGCTGGCTT 
TGTTCATATG 
CCTTATTATT 
■GTTTACAGTT 
ATTCTCTAGT 
T CTTTGTQ AA. 
TGATGACCCT 
AAAATGTATT 
ATTCTCAATA 



ATTCGAAATC 
ATTTGCGGTG 
CTACrCGAAG 
TACGAGCGTG 
CCGCTCGACC 
CCTTCGCTCC 
GAAATATAGC 
GTCAACAGTG 
ACTCGGCGAG 
AATGXGTAAT 
AG AGTGCTC A 
GTGATTCTTT 
CTTCTTTCTG 
TAATCAAACA 
CAGCCATCAC 
TTCTTTCTTT 
TATGGCTAAG 
CTGGAAAGAG 
CAATTGTCGC 
GAGCGTGACC 
GT CTTTTATC 
TTCCTTTTTG 
CCTTCTATAG 
TCTGG AATGG 
TTGATCAGCA 
TTAAGTTACC 
TACTGGATTG 
TCTTGGTTCT 
GCCTTCCTGT 
TGATJLTCAAC 
ACTTTTGTTA 
AGTGCTAGTT 
TTAGCTAATA 
TAXGGTAGAC 
TAATAGGCTA 
TTCTCATGTT 
CAATTCTTTT 
AGGGTGATAA 
GATAAGCGCG 
TGGCTGGCCG 
AGACTCTGTA 
TCACTCTCAT 
GCTCTAGTCA 
TTTCACAGCC 
ACTATGACCC 
CCTTTCTTTG 
CTTGTACGTC 
CTTTGT CTTT 
GATTATCAGT 
ATGAATGCAA 
TTGTTTTGAC 
TTATATATAA 
GTTTTTAATC 
TTGCATATAT 
TCACATTATG 



AGTAACGGCT 
GCCGAACTGC 

GTCGATCCTT 
TCGTATGGCT 
TTATTCTGTC 
AATGAAGCAT 
ACGCTTCTGA 
TCTTTCCCAT 
TAGTCTGCGC 
ATAGTAGTTA 
TGGTTGTTGC 
CTGTCAACTG 
GACCATCCCG 
AG AATTGG AT 
TGAGTTTTCT 
TTCATTAATT 
TGGCTTAAGT 
XGTTCCATAC 
TTTACTTTCA 
TTCTCTTTTT 
CAGGTTGAGC 
TATTTGCCCA 
CTGAGGCTAG 
TATCCATGCT 
ATTTTATTTT 
AGCTCGATAC 
GTTGAGGCAA 
TTCTGAGGTG 
CACCTCTTAT 
ATTGAACTGC 
ATCAAAGAAC 
CAACCAAACC 
TTTAAGTTGA 
TGATTTGTTT 
TTTTGTTGGC 
GCGTCGTCAT 
CACATTCTGG 
TATACCACTG 
ACAAATATTA 
TATGCAACTG 
ATTTCGTTCC 
ACATGACCAG 
CGTGAACTAG 
AGATAGCAAG 
TGATAAGAAA 
AAATTGTTTT 
CTTACTATAA 
TCTCTCCTTA 
AAGGGGGTAX 
TAATAGCGTC 
CATGCTAACT 
CTCTGACTCC 
GGTGGTCATC 
CGTTGACTTT 



TCCTTCACCG 
GGTGGAGGAT 
TAATCATXCC 
GACTGTCCAT 
AGACACCACT 
GGTCGAGTCA 
GTCTCGTCTC 
ATCTGAGTTT 
CG CTTTTGGT 
TTTTTATTCA 
GAAAATGTTA 
TTACTGATCG 
CTGGTTTAAG 
AGTGTAAGTC 
CCAGGTTACA 
TCAATTTGAT 
TGGTCAATTT 
GGTGTGTTGA 
GATTGGAGAT 
CAAGCTCAAG 
ATGTAAGATA 
TTTGAAACTG 
TTCAATGGGT 
AAATTGCACC 
TATTTCGCTG 
TTCTCTAATT 
CTGATTTGTT 
TCAAATCTAC 
ACCTCTGACT 
MCTCACTGG 
TGTACGCGAT 
AXATTGTGGG 
ACATGTACAC 
CAAGAAACTT 
ATTGAAATCA 
AAGGCTTCAG 
TGTGGCTTAT 
ACGCATTTTT 
TGATGAAGAG 
TTAACATTGA 
TAACACTAAC 
TTTGAXGTGT 
CATGGAATGT 
CAGATGGGAC 
AGGATGTTAC 
TATTGCTCAT 
GTTTAAATCT 
GAAACAAGTA 
TATTATGGAA 
TTTAGTTGAT 
AATTTTGTTT 
ATACTTTTCA 
TTGGGAGAGA 
TAAAGACAGA 
GTTATTATAT 



TCATCGCCGT 
GAGACCGAGT 

ACACTCCGTT 
AAGGTCCGTG 
CTTGTTGATG 
TCTTATTGAT 
AGAGTCATAT 
TCGCTAAATG 
ACTAGATCTG 
GGTCATTTTA 
ACGTGATGGA 
TGTATGGTGC 
ACGG CCTGAC 
TAACAGGTAG 
ATCATCTTGT 
TCAGGTCCTC 
CTTTGGTATA 
TGTCACCAAC 
TTAGTCCTTA 
AG CT AAG AG C 
CTTTAAAACT 
AATAATGTCT 
AAAACATTAT 
TTGGTACCGG 
GGGGGAGTTA 
GTTG ATTTAG 
TCTCTCTGGT 
TCTCTTTAGT 
ATTTTCCTTT 
ATGGTATCTG 
TAG TAT AC CT 
TGATTTAGTT 
TGACTGAAAT 
TGTG AC AT AT 
GGAACTGCTC . 
GCCATTTTCA 
CTGGGGGTGC 
GAATATCAAT 
AATTCCTTCC 
AAAAGTTTCA 
ATCCATCAGT 
GGCCTTCCCA 
TCTTTTCAAA 
AC C AGTT AAA 
CG AT CATC AC 
CTATATCAAT 
TAATCAGAAA 
TGTCTTTTTC 
TGATTCTCTC 
TTCTAGCAAA 
GGTTGTATCA 
CCACCTATAA 
GGTATGATGC 
TCCCCATTTG 
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GTTTGCAATA TC TTTTT GAA SScTGTCT GTCATAGGTT 

CCTATTAAGC GTTAAAGGTA "^fEJI ^SScTGATA ATTGGATCAT 
GGTTATTACr TTGCCCCAAG WGCAAACCT ^g^- TCA AGGTAATTTT 
CACGGATATC ATTXATGAAA CTGAAGGTrC j^^^g xaTC AGTCTA 
CCCCAATGGC ACAAGTAAAA CAC-AAGGCA ^.^^^ ^CTAAAATTT 
GTGGCATCTT ATCTCACTTG CATAAGCAAA J™™^ CAGTGGCTTA 

aagtactttt ttatcattcc. TrrrGAGCTT c ^gataact 

AAGTGGGAAG AGGTGTTGCA TOAAACATGA ^J^^ WGAGTAGTC 
ACCAAAACAA AACTAACCCA TTTCTGAATT ^tCTvtcrAO TTCATATCTT 
£££££ AAAATTTGTT |^¥SSSc G^ACGCWGA 

GATTGTGCAA TATCTGCAGG ^^I^flJ GAA&TTGGTT TTGAAATTAT 
CCtATAACTG GSGATGAGAC SStGCTTC TGGAAC AC TG 

CTTCVTGCAA ACTACTGAAG TOACtCTCCG CTACTTTTAT 

CTTCCTATGT TCTCTASTAC ACTGwAAXAX W CTCTCTTGGT 
TGATTATOAA ATTGATCCCT ^ATAGGTACC «£££ TGSC TCCCCAGGTA 
GCAAGAATTG GCTCGGACCT c ££cTTTAA GTGTACOT 

CTCTTTTTTA GTTCCTCACC ^*J*2£J ^CTTTCTAA AAATCATATA 
CTGGTTATOT GTTGATTCAC CTCCAAXTTG ™££ OAfiA TTCTCATTCG 
TCTCTGTACT CCTCAAGAAC TTGT^T^I SgCGACGTA CATGTGGAAC 
GAAAATAAAA CAACAGCCAG ^=*=GATGG ^A-TAGCTAA CATGACAAAA 
• TAAATGTTG A TCATGAGCAT ^JS^t GAAGACTCTG AGAGCATTCC 
GCACCAAGGG TTAAGTACAT IaaIaGTGGG TATTAA 

GGGGAAGAGA ACCCCAGTCT GjOWCITOA 
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<210> lltr 
<211> 709 
<2\2> COWA 
<2il» conaco 
<400> IX 

CTGGGGCCAA AAGTG AACAT AACAACGACA CCACAGTCAG AG C ATGATGT 
TCAGATGTAC AAGTGCATCT AAATATAGAG CATCAACATG GTGAAGATAT. 
CATTCCCAAT ATGACAAAGT TACCTACAAT GAAGTACATA ACCTATTATO 
AGGATTCTCA AAGTTTTCCA GGGACAAGAA CAGCAGTTTG GGAGCTTGAT 
AAAGCAAATC ACAGG AACAT TGTCAGATCT CCAGCTTXGA TGCGGGAGCT 
GTGGCTTGAG ATGTGG CATG ATATTCATCC TGATAAAAAG TCCAAGTTTG 
TTACAAAAGG TGGTGTCTGA TCCTCACTAT TTTCTTCTAT AAATGTTTGA 
GTTTGTATTG ACATTGTAAG * TATTGCAACA AAAAGCAAAG CGTGGGCCTC 
TGAGGGATGA GGACTGCTAT TGGGATTACG GGAAAGCTCG ATGTGCATGO 
GCTGAACATT GTGAATACAG GTTAGAATAT TCAAATTATA TTXTGCAAAA 
TATTCTCTTT TTGTGTATTT AGGCCACCTT TCCCCGGTCA CAACGATGCA 
GATATGTATT CGGGGATGTT CACCTGGGAC AGAGTTGCAG ATTGAAGAGT 
rCTACATCTC ACATCCTGTC ACACTATGTG TGATATTTAA GAAACTTTGT 
TTGGCGGAAC AACAAGTTTG CACAAACATT TGAAGAAGAA AG CG AAATG A 
TTCAGAGAC 
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